OTIC  HlE  CUKJf 


2500  Mission  College  Boulevard 
Santa  Ciara,  California  95054-1215 


AFOSR.llt.  8  8 


:-S0e  'JC.  *  Vie/  55963!  •  FAX:  ,406)  960-0400 


0  848 


Annual  Technical  Report 
1  Jun-j  1986  through  31  May  1987 

(VI 

o 


00 

o 


®  Adaptive  Control  Techniques  for 

^  Large  Space  Structures 

l 


Q 


Santa  Clara,  California  95054-1215 


Prepared  for; 

AFOSR,  Directorate  of  Aerospace  Sciences 
Bolling  Air  Force  Base 
Washington,  DC  20332 


Attention: 

Dr.  Anthcny  Amos 
Building  410 


DT1C 

electe 


ix  AUG  2  5  1988 


Under  Contract  No.  F49620-35-C-0094 
1ST  No-  Ho 


88  8  25  09Q 


UNCLASSIFIED 


'.ECURITY  CLASSIFICATION  OF  THIS  FA  CE 


REPORT  DOCUMENTATION  PAGE 

1 4  REPORT  SECURITY  CLASSIFICATION 

Unclassified 

1b  RESTRICTIVE  MARKINGS 

24  SECURITY  CLASSIFICATION  AUTHORITY 

3.  Distribution/availability  of  report 

Unclassified 

3b.  OECLASSIF  1  CATION/DOWNGRADING  SCHEDULE 

S.  MONITORING  ORGANIZATION  REFORT  NUMSERlS) 


I 


.4.  performing  organization  refort  numberisi 
ISI  No.  no  * 


St.  NAME  OF  PERFORMING  ORGANIZATION  Be.  OFFICE  SYMBOL  7t.  NAME  OF  MONITORING  ORGANIZATION 

I  (Ifapp  lico  bit  I 


INTECRATED  SYSTEMS,  INC. 


6c.  AOORESS  (City,  Stilt  and  ZIP  Cod*) 

5,500  NV\ss'o^  Co\\«y.  • 

S^\o,  C\oy*,  CP\  ^So5V 


AF0SR-TR.  88-  0  848 


74.  NAME  OF  MONITORING  ORGANIZATION 

Arosf5- 


7e.  ADORESS  (City,  Slot*  and  ZIP  Codt) 

ei  &  S  8-c* 


-fos 


9.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 

F49620-85  OW*  O  OQc^ 


10  SOURCE  OF  FUNDING  NOS. 


11.  TITLE  ( Includ t  Stcunty  CloasifitationJ 

Adaptive  Control  Techniques  for  Large  Space  St 


13.  PERSONAL  AUTHORIS) 

Robert  L.  Kosut,  Michael  G.  Lyons 


13a  TYPE  OF  REPORT  I  13e  TIME  COVERED  14.  DATE  OF  REPORT  (Yr..  Mo..  Day) 

Annual  Technical  Reptj  from  i  l_8t_  to  S  >3.\  8*1 


WORK  UNIT 
NO. 


COSAT  I  COOES 


IELD  group  sub  gr 


18.  SUBJECT  TERMS  (Continue  on  revert?  if  neceeeory  end  identify  by  block  number} 


ABSTRACT  (Continue  On  reveree  if  neceeeery  end  identify  by  block  number / 

his  report  summarizes  the  research  performed  during  the  period  !  June  1985  to  31  Kay  1986 
on  adaptive  control  techniques  for  Large  Space  Structures  (LSS).  The  research  effort 
concentrated  on  two  areas:  (I)  on-line  robust  design  from  identified  models  -  what  is 
referred  to  here  as  adaptive  calibration;  and  (2)  an  analysis  of  slow-adaptation  for 
adaptive  control  of  LSSi  The  report  summarizes  the  results  obtained  in  these  areas  and 
also  includes  Appendices  which  contain  technical  articles  l  v O'  /V*  ^  ^  ,  e  Cr  v  .  1  ; 

!  ...  .  _  ", . --  .  ?  t-lfi  ,  ~  V-  <  ( C-l ,  *  .  ,,  /  U,<  r.  (  •  V  /"  J 1  , 


<lv  ft»  f  ’*  T",  .•  •  Or-  **  U-  U$.C  a_6. 

\  ~  A  -/  /\  J />  ,i  r  L-?1  r  •  Ct't  f i  <*  (  l?y  rT-e  pi*  ^ 


/ 

V  *  )  '  T 


/  (  ~  ■ 

,  0-t  ■  C*  ■  -  '•  f  '  1  •./  A  .  ■ ,  <f  •  ft  /-’  • 


20  0»STRIBUTI0N/AVAILAB»LITY  of  abstract 


}€'*?-  '  C  Cluj. 


L  *  {  t 


r>  9 

1  I. 

r‘  i  ■  '  '  t>  C 


21  ABSTRACT  SECURITY  CLASSIFICATION 


Iunclassifieo/unlimiteo  ^  same  as  rpt  (Biotic  users  1  Unclassified 


324  NAME  OF  RESPONSIBLE  INDIVIDUAL  >  >1  22b  TE  LEPHONE  NUMBE  R 


PD  FORM  1473.  83  APR 


EDITION  OF  1  JAN  73  IS  OBSOLETE. 


SECURITY  CLASSIFICATION  OF  THIS  PAGE 


Contents 


1  Introduction 

1.1  Background  and  Motivation 

1.2  Research  Objectives  .  .  .  . 


]  Accession  Fop 

NTIS  GRA&I 
DTIC  TAB 
Unannounced 
Justifloatic 

6^ 

□ 

in 

Rv 

Distribution/ 

Availability  Codes 

Dist 

6± 

Avail 

Spec 

and/or 

lal 

1.3  Current  Status . 

1.4  Selected  Publications  to  Date  .  .  . 

1.4.1  Journals  and  Conferences  . 

1.4.2  Books . 

1.4.3  Other  Related  Publications 

1.5  Collaborative  Research  Effort  .  .  . 

1.6  Future  Directions . 

1.7  Organization  of  Report . 


1 

1 

2 

2 

3 

3 

4 
4 

4 

5 
5 


2  Overview  of  Research  Activities  7 

2.1  Motivation  and  Objectives .  7 

2.2  Summary  of  Recent  Research .  8 

2.2.1  Stability  and  Convergence  Analysis .  8 

f  2.2.2  Uncertainty  Estimation .  10 

2.3  New  Research  Directions .  12 

2.4  Model  Structure  Selection  for  Control  Design .  13 

2.4.1  Model  Parametrization .  13 

2.4.2  Model  Evaluation .  13 

2.4.3  Use  of  the  MSE  for  Analysis  .  15 

2.4.4  Experiment  Design .  15 

2.4.5  Control  Design  Criteria  and  Approaches .  16 

2.5  Computing  the  Model  Estimate  from  Data  .  17 

2.5.1  Adaptive  Algorithms .  18 


CO 


2.5.2  Performance  of  the  Algorithm .  19 

2.5.3  Multiple  Model  Adaptive  Control .  19 

2.6  Computing  Model  Uncertainty  from  Data . 20 

2.7  Methods  of  Stability  and  Convergence  Analysis . 21 

2.7.1  A  Bit  of  History . 21 

2.7.2  New  Research  Directions  .  22 

3  Identification  for  Control  Design  25 

3.1  Introduction .  25 

3.2  “True”  System  vs.  Model  Set . 25 

3.3  Conditions  for  Closed-Loop  Stability  Robustness . 26 

3.4  Review  of  lYansfer  Function  Estimation .  27 

3.4.1  Sampled-data  Representation  and  Data  Structure . 28 

3.4.2  Prediction  Error  Methods  of  Parameter  Estimation . 28 

3.4.3  Least-Squares  Criterion .  29 

3.4.4  Least-Peak  Criterion . 29 

3.4.5  Computing  Model  Error:  The  Mean-Square-Error  (MSE) . 29 

3.4.6  Non-Parametric  Methods  of  Transfer  Function  Estimation . 30 

3.5  Application  to  LSS . 31 

3.5.1  Finite  Element  Modeling  . 31 

3.5.2  Continuum  Modeling . 32 

3.5.3  Spatial  Measurements . 32 

3.6  Uncertainty  Estimation . 32 

3.7  Computing  the  Estimate . 33 

3.7.1  Gradient  Algorithms . 34 

3.7.2  Gauss-Newton  Algorithm . 34 

3.7.3  Constructing  the  Error  Gradient  from  Measured  Signals . 35 

3.7.4  Algorithm  Modifications . 35 

3.7.5  Performance  of  the  Algorithm . 36 

3.8  Averaging  Analysis  of  Stability  and  Convergence . 36 

3.8.1  Method  of  Averaging . 36 

3.8.2  Asymptotic  Analysis  . 37 

3.8.3  Transient  Analysis . 37 

3.9  References . 39 

(u-) 


A  Reprints  of  Selected  Publications 


43 


List  of  Figures 


2.1  Two- Level  Control  Architecture .  8 

2.2  HAC/L AC  Performance  vs.  Structural  Parameter  Variation .  9 

2.3  Adaptive  Control .  10 

2.4  Adaptive  Control  with  AI  Outer- Loop .  11 

2.5  Multiple  Model  Adaptive  Control . 20 

3.1  A  generic  parametric  model  structure  . 33 


Chapter  1 

•  .1  >■ 

Introduction 


1.1  Background  and  Motivation 


The  Large  Space  Structure  (LSS)  research  program  was  originally  formulated  in  late  1982  in  re¬ 
sponse  to  the  increasing  concern  that  performance  robustness  of  Air  Force  LSS  type  system  would 
be  inadequate  to  meet  mission  objectives.  In  particular,  uncertainties  in  both  system  dynamics  and 
disturbance  spectra  characterizations  (both  time  varying  and  stochastic  uncertainty)  significantly 
limit  the  performance  attainable  with  fixed  gain,  fixed  architecture  controls.  Therefore,  the  use 
of  an  adaptive  system,  where  disturbances  and/or  plant  models  are  identified  prior  to  or  during 
control,  gives  systems  designers  more  options  for  minimizing  the  risk  in  achieving  performance 
objectives. 

The  aim  of  adaptive  control  i6  to  implement  in  real-time  and  on-line  as  many  as  possible  of  the 
design  functions  now  performed  off-line  by  the  control  engineer;  to  give  the  controller  “intelligence”. 
To  realize  this  aim,  both  a  theory  of  stability  and  performance  of  such  inherently  nonlinear  controls 
is  essential  as  well  as  a  technology  capable  of  achieving  the  implementation. 

The  issues  of  performance  sensitivity,  robustness,  and  achievement  of  very  high  performance 
in  an  LSS  system  can  be  effectively  addressed  using  adaptive  algorithms.  The  need  to  identify 
modal  frequencies,  for  example,  in  high-performance  disturbance  rejection  systems  has  been  shown 
in  ACOSS  (1981)  and  VCOSS  (1982).  The  deployment  of  high-performance  optical  or  RF  systems 
may  require  on-line  identification  of  critical  modal  parameters  before  full  control  authority  can  be 
exercised.  Parameter  sensitivity,  manifested  by  performance  degradation  or  loss  of  stability  (poor 
robustness)  may  be  effectively  reduced  by  adaptive  feedback  mechanizations.  Reducing  the  effects  of 
on-board  disturbance  rejection)  is  particularly  important  for  planned  Air  Force  missions.  For  these 
cases,  adaptive  control  mechanizations  are  needed  to  produce  the  three-to-five  orders-of-magnitude 
reductions  in  line-of-sight  jitter  required  by  the  mission. 

Research  is  essential  to  identify  the  performance  limitations  of  adaptive  strategies  for  LSS  con¬ 
trol  both  from  theoretical  and  hardware  mechanization  viewpoints.  The  long  range  goal  of  this 
research  program  is  to  establish  guidelines  for  selecting  the  appropriate  strategy,  to  evaluate  per¬ 
formance  improvements  over  fixed-gain  mechanizations,  and  to  examine  the  architecture  necessary 
to  produce  a  practical  hardware,,  realization.  The  initial  thru6t,  however,  is  to  continue  to  build  a 
strong  theoretical  foundation  without  losing  sight  of  the  practical  implementation  issues. 
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1.2  Research  Objectives 

•  .1  * 

The  aims  of  this  research  study  are  to  extend  and  develop  adaptive  control  theory  and  its  application 
to  LSS  in  several  directions.  These  include: 

1.  Theoretical  Development:  The  initial  emphasis  has  been  on  slow  adaptation,  since  this 
covers  may  LSS  situations.  Later  on  we  will  examine  fast  adaptation.  The  theory  developed 
here  will  provide  for: 

(a)  estimates  of  robustness,  i.e.,  stability  margins  vs.  performance  bounds; 

(b)  estimates  of  regions  of  attraction  and  rates  of  parameter  convergence  to  these  regions; 

(c)  extension  of  the  present  linear  finite  dimensional  adaptive  theory  to  include  nonlinear 
and  infinite  dimensional  plants  and  controller  structures;  and 

(d)  extensions  to  decentralized  systems. 

2.  Parameter  Adaptive  Algorithms  Assess  the  behavior  of  different  algorithms,  including: 
gradient,  recursive  least  squares,  normalized  least  mean  squares,  and  nonlinear  observer  (e.g, 
Extended  Kalman  Filter). 

3.  Parametric  Models:  Assess  the  impact  of  model  choices.  In  particular  we  will  examine 
the  effect  of  explicit  and  implicit  model  choices.  An  explicit  model,  for  example,  is  a  transfer 
function  whose  coefficients  are  all  unknown.  In  an  implicit  model  transfer  function,  the 
coefficients  would  be  functions  of  some  other  parameters.  Implicit  models  usually  arise  from 
physical  or  experimental  data,  whereas  explicit  models  are  selected  for  analytical  convenience. 

4.  Adaptive  Nonlinear  Control:  Although  our  early  effort  is  to  study  adaptive  linear  control, 
there  are  may  LSS  situations  where  the  control  is  nonlinear,  e.g.,  large  angle  maneuvers, 
slewing. 

1.3  Current  Status 

At  the  present  time  we  stand  at  the  beginning  stages  of  the  theoretical  development  in  adaptive 
control.  The  result  of  recent  efforts  are  contained  in  the  selected  papers  in  the  Appendix  and  the 
references  therein.  A  summary  of  earlier  efforts  is  contained  in  the  recently  published  textbook 
Stability  of  Adaptive  Systems:  Passivity  and  Averaging  Analysis,  MIT  Press,  1986.  This  publica¬ 
tion  is  an  outgrowth  of  research  supported  under  this  contract  and  involved  a  considerable  amount 
of  collaborative  effort  among  several  researchers  in  the  field  of  adaptive  control.  The  text  discusses 
adaptive  systems  from  the  viewpoint  of  stability  theory.  The  emphasis  is  on  methodology  and  basic 
concepts,  rather  than  on  details  of  adaptive  algorithm.  The  analysis  reveals  common  properties 
including  causes  and 'mechanisms  for  instability  and  the  means  to  counteract  them.  Conditions 
for  stability  are  presented  under  slow  adaptation,  where  the  method  of  averaging  is  utilized.  In 
this  latter  case  the  stability  result  is  local,  i.e.,  the  initial  parametrization  and  input  spectrum  is 
constrained.  Based  on  this  analysis,  a  conceptual  framework  is  now  available  to  pursue  the  issues 
of  slow  adaptive  control  of  LSS. 

To  remove  the  restrictiveness  of  slow  adaptation  requires  an  understanding  of  the  transient 
behavior  of  adaptive  systems.  A  preliminary  investigation  is  reported  in  Kosut  et  al  (1986).  The 
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transient  behavior  of  not-slow  or  even  rapid  adaptation  is  a  significant  problem  in  the  adaptive 
control  of  LSS,  e.g.,  rapid  retargeting. 

Another  approach  to  adaptive  control  is  to  calibrate  (or  tune)  the  controller  based  on  a  current 
estimate  of  the  LSS  model.  This  involves  not  just  knowing  one  model,  but  rather,  a  model  set.  This 
problem,  which  we  refer  to  as  adaptive  calibration,  is  essentially  that  of  developing  a  technique  of 
on-line  robust  control  design  from  an  identified  model.  Although  we  have  worked  on  this  problem 
fcr  some  time  it  is  only  recently  that  we  have  established  a  theoretical  basis  for  estimating  model 
error  from  system  identification  [see  Kosut  (1987),  a  reprint  is  in  the  Appendix].  This  research 
has  raised  many  new  questions  which  need  to  be  considered,  e.g.,  what  is  the  appropriate  robust 
controller  parametrization;  how  does  it  relate  to  model  parametrization;  how  to  iterate  on  the  data 
if  the  estimate  of  model  error  is  too  large;  what  are  the  heuristics  for  experiment  design. 


1.4  Selected  Publications  to  Date 

1.4.1  Journals  and  Conferences 

R.L.  Kosut,  “On  The  Use  of  The  Method  of  Averaging  for  the  Stability  Analysis  of  Adaptive 
Linear  Control  Systems”,  Proc.  IEEE  CDC,  Los  Angeles,  CA,  Dec.  1987. 

R.L.  Kosut,  “Conditions  for  Convergence  and  Divergence  of  Parameter  Adaptive  Linear  Systems”, 
Proc.  ISCAS  198 7,  Philda.,  PA,  May,  1987. 

R.L.  Kosut,  “Adaptive  Calibration:  An  approach  to  Uncertainty  Modeling  and  On-Line  Robust 
Control  Design”,  Proc.  25th  IEEE  CDC,  Athens,  Greece,  Dec.  1986. 

R.L.  Kosut,  “Towards  an  On-Line  Procedure  for  Automated  Robust  Control  Design:  The  Adaptive 
Calibration  Problem”,  presented  at  1986  ACC,  June  1986. 

R.L.  Kosut,1  I.M.Y.  Mareels,  B.D.O.  Anderson,  R.R.  Bitmead,  and  C.R.  Johnson,  Jr.,  “Transient 
Analysis  of  Adaptive  Control”,  submitted  to  IFAC  10th  World  Congress,  Munich,  Germany, 
July  1987. 

R.L.  Kosut,1  and  R.R.  Bitmead,  “Fixed-Point  Theorems  for  Stability  Analysis  of  Adaptive  Sys¬ 
tems”,  Proc.  IFAC  Workshop  on  Adaptive  Systems,  Lund,  Sweden,  July  1986. 

R.L.  Kosut,1  and  R.R.  Bitmead,  “Linearization  of  Adaptive  Systems:  A  Fixed-Point  Analy¬ 
sis”,  submitted,  IEEE  Trans,  on  Circuits  and  Systems;  Special  Issue  on  Adaptive  Systems, 
Sept.  1987. 

I.M.Y.  Mareek,  R.R.  Bitmead,  M.  Gevers,  C.R.  Johnson,  Jr.,  R.L.  Kosut,1  and  M.A.  Poubelle, 
“How  Exciting  Can  a  Signal  Really  Be?”,  to  appear,  Systems  and  Control  Letters. 

R.L.  Kosut,1  B.D.O.  Anderson,  and  I.M.Y.  Mareels,  “Stability  Theory  for  Adaptive  Systems: 
Methods  of  Averaging  and  Persistency  of  Excitation”,  IEEE  Trans,  on  Aut.  Contr.,  to 
appear,  Jan.  1987. 

B.D.O.  Anderson,  R.R.  Bitmead,  C.R.  Johnson,  Jr.,  and  R.L.  Kosut,  “Stability  Theorems  for  the 
Relaxation  of  the  SPR  Condition  in  Hyperstable  Adaptive  Systems”,  IEEE  Trans,  on  Aut. 
Contr.,  submitted. 

1  Research  performed  while  R.L.  Koeut  was  a  Visiting  Fellow  at  the  Australian  National  University. 
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R.L.  Kosut  and  C.R.  Johnson,  Jr.,  “An  Input-Output  View  of  Robustness  in  Adaptive  Control”, 
Automatica:  Special  Issue  on  Adaptive  Control,  20(5):569-581,  Sept.  1984. 

R.L.  Kosut,2  and  B.  Friedlander,  “Robust  Adaptive  Control:  Conditions  for  Global  Stability”, 
IEEE  Hans,  on  Aut.  Contr.,  AC-30(7):610-624,  July  1985. 


1.4.2  Books 

B.D.O.  Anderson,  R.R.  Bitmead,  C.R.  Johnson,  Jr.,  P.V.  Kokotovic,  R.L.  Kosut,  I.M.Y.  Mareels, 
L.  Praly,  and  B.D.  Riedle,  Stability  of  Adaptive  Systems:  Passivity  and  Averaging  Analysis, 
MIT  Press,  1986. 

R.L.  Kosut,  “Methods  of  Averaging  for  Adaptive  Systems”,  Adaptive  Systems:  Theory  and  Ap¬ 
plications,  Editor:  K.S.  Narendra,  Plenum  Press,  1986. 

R.L.  Kosut  and  M.G.  Lyons,  “Issues  in  Control  Design  for  Large  Space  Structures”,  Adaptive 
Systems:  Theory  and  Applications,  Editor:  K.S.  Narendra,  Plenum  Press,  1986. 

R.L.  Kosut,  “Adaptive  Control  of  Large  Space  Structures:  Uncertainty  Estimation  and  Robust 
Control  Calibration”,  Large  Space  Structures:  Dynamics  and  Control ,  Editors:  S.N.  Atluri 
and  A.K.  Amos,  Springer- Verlag,  1987. 

1.4.3  Other  Related  Publications 

M.L.  Workman,  R.L.  Kosut,3  and  Franklin,  “Adaptive  Proximate  Time-Optimal  Servomecha¬ 
nisms:  Continuous-Time  Case”,  Proc.  ACC,  pp.589-594,  June  1987,  Minneapolis,  MN. 

M.L.  Workman,  R.L.  Kosut3,  and  Franklin,  “Adaptive  Proximate  Time-Optimal  Servomecha¬ 
nisms:  Discrete-Time  CJas^”,  Proc.  CDC,  Dec.  1987,  Los  Angeles,  CA. 

R. L.  Kosut,4  A.  Pascoal,  S.  Morrison,  and  M.L.  Workman,  “Time-Optimal  Control  of  Large  Space 

Structures”,  Proc.  SPIE,  Jan.  1988,  Los  Angeles,  CA. 

S.  Philips,  R.L.  Kosut3,  and  G.F.  Franklin,  “An  Averaging  Analysis  of  Discrete- Time  Indirect 

Adaptive  Control”,  to  appear,  Proc.  ACC,  June  1988,  Atlanta,  GA. 


1.5  Collaborative  Research  Effort 

Dr.  Kosut  has  continual  exchanges  and  collaboration  with  many  of  the  leading  researchers  in  the 
field  of  adaptive  control  and  system  identification.  He  has  made  two  visits  to  the  Australian 
National  University  in  order  to  work  jointly  with  Prof.  B.D.O.  Anderson  and  Prof.  R.R.  Bitmead 
of  the  Systems  Engineering  Dept.  These  visits  were  supported  by  AFOSR  under  this  contract 
with  travel  grants  from  the  Australian  National  University  and  a  joint  research  grant  with  ISI  and 
Cornell  University  (Prof.  C.R.  Johnson,  Jr.  was  co-principal  investigator)  under  funding  from  the 

*Started  under  contract  F4920-81-C-0051. 

a  Research  supported  partly  by  NSF  Industry/  University  Cooperative  Research  Program  under  Grant  ECS- 
8605646. 

'Resarch  supported  by  SDIO/IST  and  managed  by  AFOSR. 
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NSF/US  Australia  Cooperative  Research  Program.  Dr.  Kosut  also  visited  on  several  occasions  with 
Prof.  Kokotovic  of  the  Coordinated  Science  Lab  of  the  University  of  Illinois  in  Urbana,  Illinois, 
and  with  Prof.  C.R.  Johnson,  Jr.  of  the  School  of  Electrical  Engineering  of  Cornell  University  in 
Ithaca,  New  York.  The  textbook  Stability  of  Adaptive  Systems  (MIT  Press,  1986)  and  numerous 
joint  papers  (see  Section  1.4)  were  a  direct  outcome  of  these  visits. 

Dr.  Kosut  has  also  been  invited  to  spend  some  time  in  Sweden  with  Prof.  K.J.  Astrom  of  the 
Lund  Institute  of  technology  and  with  Prof.  L.  Ljung  of  Linkoping  University,  both  of  whom  we 
experts  in  the  field  of  system  identification  and  adaptive  control.  These  visits  are  supported  by 
AFOSR,  Directorate  of  Aerospace  Sciences,  the  NSF  Industry/University  Cooperative  Research 
Program,  and  the  NSF  U.S. /Sweden  Cooperative  Research  Program. 

Prof.  E.  Crawley  from  the  Dept,  of  Aeronautics  and  Astronautics  of  MIT  will  be  spending  his 
sabbatical  leave,  starting  in  December  1987,  at  Stanford  University.  There  he  will  collaborate  with 
Dr.  Kosut  on  a  textbook  on  the  dynamics  and  control  of  large  space  structures. 

Dr.  Kosut  is  also  collaborating  with  Professor  C.  R.  Johnson,  Jr.  from  Cornell  University  on  a 
textbook  on  robust  and  adaptive  control. 


1.6  Future  Directions 

Based  on  our  recent  results  as  reported  here,  we  envision  near-term  activity  in  several  directions, 
including:  ,  .  .  . 

•  transient  analysis  of  adaptive  control; 

•  analysis  of  adaptive  calibration; 

•  decentralized  control  structures; 

•  effect  of  nonlinear  and  infinite  dimensional  phenomena; 

•  effect  of  different  algorithms  and  parametrizations. 

1.7  Organization  of  Report 

In  Chapter  2  we  provide  an  overview  of  our  rerseach  efforts  in  this  past  year.  Specifically,  the 
chapter  contains  brief  discussions  of  some  of  the  research  areas  listed  above,  pointing  out  the 
general  research  directions,  and  in  particular,  their  specific  applicability  to  LSS  systems. 

Chapter  3  contains  a  parallel,  but  more  in-depth  technical  discussion  of  the  same  research  topics. 

There  is  also  an  Appendix  which  contains  6ome  of  our  recent  publications  from  this  past  year. 


Chapter  2 


Overview  df  Research  Activities 

2.1  Motivation  and  Objectives 


Many  of  the  envisioned  future  large  space  structure  (LSS)  missions  will  impose  stringent  perfor¬ 
mance  demands  on  tracking  accuracy  and  structural  vibration  attenuation.  For  example,  some 
planned  missions  will  require  three  to  five  orders  of  magnitude  reduction  in  line-of-sight  optical 
jitter.  Both  active  feedback  control  and  passive  damping  will  thus  be  a  practical  necessity,  and 
moreover,  their  design  will  require  a  model  of  the  LSS  system  whose  accuracy  is  compatible  with 
the  performance  demands.  Even  random  variations  in  materials  and  manufacturing  tolerances 
will  significantly  degrade  closed-loop  performance.  For  example,  the  need  to  have  very  accurate 
models  for  high  performance  disturbance  rejection  has  been  demonstrated  in  the  ACOSS(1981) 
and  VCOSS(1982)  programs.  The  deployment  ofhigh-performance  optical  or  RF  systems  will  also 
require  accurate  models  before  full  control  authority  can  be  exercised.  As  a  result  of  these  and 
other  research  programs,  it  is  clear  that  the  on-orbit  dynamics  of  LSS  will  not  be  sufficiently  like 
those  obtained  from  either  ground-testing  or  even  from  sophisticated  computer  generated  modeling 
techniques,  such  as  finite  element  modeling.  Current  structural  modeling  techniques  are  just  not 
sufficiently  accurate  or  able  to  account  for  all  the  possible  sources  of  parameter  variation.  There¬ 
fore,  under  these  conditions,  it  will  be  necessary  to  identify  the  LSS  dynamics  directly  from  on-orbit 
mtasurements,  and  simultaneously,  tune  ^r  re-design  the  control.  Hence,  the  control  design  cycle 
will  be  an  adaptive  process,  typically  starting  with  a  nominal  low-performance  design  based  on 
a  coarse  model,  and  then  re-designed  from  on-orbit  data.  The  adaptation  may  take  place  either 
off-line  or  simultaneously  while  the  system  is  operating,  but  in  either  case,  the  process  will  require 
on-orbit  data. 

As  an  illustration  of  adaptive  control  for  an  LSS  system,  consider  the  two-level  control  archi¬ 
tecture  depicted  in  Figure  2.1.  The  two  levels  consist  of  a  colocated  rate  damping  controller  and 
a  non-colocated  high  performance  controller.  The  colocated  controller  consists  of  active  and/or 
passive  rate  damping  devices  placed  at  critical  structural  locations,  and  their  design  requires  only  a 
coarse  knowledge  of  system  dynamics.  This  is  an  inherently  robust  controller,  but  yields  low  perfor¬ 
mance,  and  has  been  referred  to  as  the  low  authority  control  (LAC)  system.  The  high  performance 
controller  is  non-colocated  and  requires  an  accurate  knowlege  of  critical  modes,  and  hence,  is  very 
sensitive  to  structural  parameter  variations.  This  controller,  referred  to  as  the  high  authority  con¬ 
trol  (HAC)  system,  provides  high  damping  and  mode  shape  adjustment  in  selected  modes  in  order 
to  meet  the  performance  demands.  We  remark  that  the  stringent  performance  demands  cannot  be 
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Figure  2.1:  Two-Level  Control  Architecture 


|  met  with  passive  or  active  rate  damping  mechanisms  alone.  Also,  the  synthesis  and  design  of  the 

H AC/LAC  system  has  to  be  properly  integrated  to  avoid  performance  degradation  due  to  modeling 
inaccuracies,  see,  e.g.,  Aubrun  et  al.(1979).  With  this  architecture  only  the  HAC-system  is  likely 
to  be  tuned  by  an  adaptive  system,  as  shown  in  Figure  2.1. 

The  limitations  of  the  HAC  and  LAC  systems  performance  with  respect  to  structural  parameter 
I  variations  is  illustrated  in  Figure  2.2.  Observe  that  the  LAC  system  provides  significant  robustness 

to  parameter  variations,  but  the  performance  increase  over  the  open  loop  structure  is  moderate. 
The  opposite  is  true  for  the  HAC  system,  which  provides  significant  performance  increase,  but  is 
very  sensitive  to  parameter  variations.  The  adaptive  control  of  the  HAC  system  allows  for  a  much 
wider  latitude  in  parameter  variation  while  maintaining  the  performance  level  required. 

The  objective  of  our  research  program  is  to  establish  the  theoretical  foundations  and  performance 
limitations  for  the  application  of  adaptive  control  to  large  space  structures.  To  realize  ' his  aim,  the¬ 
ories  of  both  stability  and  performance  of  such  an  inherently  nonlinear  control  system  are  essential, 
as  well  as  a  technology  capable  of  achieving  the  implementation. 


2.2  Summary  of  Recent  Research 

.  .i 

2.2.1  Stability  and  Convergence  Analysis 

Until  recently,  the  underlying  conditions  for  stability  and  convergence  and  the  causes  for  divergence 
and  instability  in  adaptive  control  systems  have  not  been  entirely  understood.  We  our  proud  to 
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Figure  2.2:  HAC/LAC  Performance  vs.  Structural  Parameter  Variation 

6ay  that  our  recent  research,  supported  by  the  AFOSR  Directorate  of  Aerospace  Sciences,  has 
contributed  significantly  to  a  deeper  understanding  of  these  phenomena.  In  fact,  when  we  began 
our  research  program,  the  then  existing  theory  of  adaptive  control  was  wholly  inapplicable,  not 
only  for  LSS  systems,  but  for  any  dynamic  system.  The  main  reason  for  this  was  that  the  then 
existing  theory  required  perfectly  accurate  models,  which  of  course  is  an  unrealistic  requirement. 

As  a  consequence  of  this  lack  of  appropriate  theory,  a  major  emphasis  in  our  recent  research  has 
been  the  development  of  a  fundamental  theory  of  adaptive  control  which  is  applicable  to  realistic 
dynamic  system  models. 

In  commencing  our  research  program,  we  first  recognized  that  the  problems  in  establishing 
realistic  conditions  for  stability  and  convergence  arise  not  only  from  the  unique  dynamical  character 
of  LSS  systems,  but  also  from  the  basic  philosophy  underlying  the  design  of  an  adaptive  control 
system.  As  depicted  in  Figure  2.3,  there  are  essentially  two  basic  processes,  namely:  (1)  a  model 
estimator,  and  (2)  a  control  design  rule. 

The  model  estimator  operates  on  the  input-output  data  obtained  from  measurements  of  the 
system  to  be  controlled,  producing  a  model  estimate  from  a  pre-determined  model  set.  The  model 
estimate  is  transformed  by  the  control  design  rule  into  a  controller  to  be  used  in  feedback  with  the 
actual  system.  The  fundamental  question  is:  when  will  it  work?  That  is,  under  what  conditions 
will  the  model  estimate  converge  to  a  good  model  in  the  allowed  model  set.  By  a  good  model 
is  meant  one  that  produces  a  controller,  via  the  control  design  rule,  which  when  applied  to  the 
actual  system  yields  acceptable  closed-loop  performance.  Even  if  such  a  good  model  exists,  the 
estimated  model  may  not  converge,  even  if  the  estimated  model  is  initialized  close  to  a  good  model. 
Moreover,  if  convergence  is  too  slow,  then  unacceptable  behavior  can  occur  during  the  learning 
process.  Thus,  convergence  alone  is  not  a  sufficient  condition  for  establishing  good  performance. 
We  also  need  information  about  the  convergence  rate,  particularly  how  it  can  be  affected  by  user 
choices  of  inputs,  data  filters,  etc. 

During  our  research  on  the  problem  of  stability  and  convergence  we  worked  closely  with  several 
other  researchers.  A  summary  of  these  efforts  is  contained  in  the  recently  published  textbook  Sta- 
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Figure  2.3:  Adaptive  Control 


bility  of  Adaptive  Systems:  Passivity  and  Averaging  Analysis  (MIT  Press)  by  Anderson  et  al.(  1986). 
The  stability  analysis  focuses  on  6low  adaptation  and  the  use  of  some  of  the  classical  methods  for 
analyzing  differentia]  equations,  e.g.,  linearization,  the  method  of  averaging,  and  Lyapunov's  second 
method,  see,  e.g.,  Hale(1969).  The  material  in  the  text  represents  modifications  and  refinements  of 
earlier  work,  specifically:  Astrom  (1983, 1984)  showing  how  the  method  of  averaging  explains  insta¬ 
bilities  and  drift;  Reidle  and  Kokotovic  (1985, 1986)  on  slow  adaptation  and  the  integral  manifold; 
Kosut,  Anderson,  and  Mareels(1987)  on  the  relation  between  averaging  and  persistent  excitation; 
Kosut  (1985)  and  Bodson  et  al.(1986)  on  nonlinear  averaging  analysis  and  determining  the  rate  of 
convergence;  and  Kosut  and  Anderson  (1986),  Kosut  and  Johnson(1984)  on  linearization  and  local 
stability. 

The  above  analysis  reveals  common  properties  including  the  causes  and  mechanisms  for  insta¬ 
bilities  and  the  meads  to  counteract  them.  In  particular,  conditions  for  stability  axe  presented 
under  slow  adaptation,  involving  the  method  of  averaging.  Based  on  this  analysis,  a  conceptual 
framework  is  now  available,  for  the  first  time,  to  pursue  adaptive  control  of  LSS  systems,  under 
slow  adaptation.  We  remark  that  in  the  literature  on  adaptive  control,  this  issue  is  often  referred 
to  as  the  robustness  of  adaptive  control,  i.e.,  what  happens  under  non-ideal  conditions  such  as  the 
effect  of  unmodeled  dynamics  and  disturbances.  [  An  historical  perspective  of  this  research  area  is 
provided  in  Section  2.7.1]. 

To  remove  the  restrictiveness  of  slow  adaptation  requires  an  understanding  of  the  transient 
behavior  of  adaptive  systems.  Preliminary  investigations  are  reported  in  Kosut  and  Bitmead(1986) 
and  Kosut  et  al.(1987).  This  pursuit  involves  new  analysis  methods,  not  necessarily  averaging,  and 
further  development  of  these  and  other  techniques  is  one  of  the  our  research  goals. 


2.2.2  Uncertainty  Estimation 

Referring  again  to  Figure  2.3,  we  see  from  the  previous  discussion  that  the  adaptive  control  system 
ib  working  if  one  can  prove  that  the  estimated  model  converges  to  a  “good”  model  of  the  true 


Figure  2.4:  Adaptive  Control  with  AI  Outer-Loop 


system.  And,  for  example,  analysis  techniques  based  on  the  method  of  averaging,  as  described 
above,  do  provide  an  assessment  of  the  adaptive  system  under  slow  adaptation. 

Another  route,  which  we  are  currently  exploring,  is  to  modify  the  identification  process  so  that 
it  produces  not  only  an  estimated  model,  but  a  measure  of  model  uncertainty  as  well.  With  such 
an  estimate  in  hand  the  control  design  rule  can  be  modified  to  account  for  the  model  uncertainty. 
If  the  resulting  controller  is  too  cautious,  because  the  model  estimate  is  too  uncertain,  then  it  is 
necessary  to  repeat  the  estimation  procedure  so  as  to  obtain  a  better  model. 

Thus,  the  adaptive  control  now  contains  an  outer  loop,  as  shown  in  Figure  2.4  which  provides 
some  expert  advice  as  to  when  to  use  the  model  estimate.  Other  “artificial  intelligence”  (Al) 
features  can  also  be  incorporated  in  the  outer  loop.  This  feature  will  be  expanded  upon  in  our  new 
program. 

In  our  current  research  program  we  are  concentrating  on  a  particular  procedure  for  obtaining 
a  measure  of  model  uncertainty  for  linear  time  invariant  plants.  The  process  is  referred  to  as  un¬ 
certainty  estimation  and  the  outcome  is  a  frequency  domain  expression  for  the  model  uncertainty. 
The  motivation  arises  from  robust  control  theory  which  utilizes  frequency  domain  expressions  for 
characterizing  a  set  of  uncertainty  within  which  lies  the  true  plant,  e.g.,  Francis  and  Zames(1983), 
Safonov  et  al. (1981),  Doyle  and  Chu(1985),  Vidyasagar( 1 985).  The  set  of  uncertainty  is  extracted 
from  input-output  measurements.  Our  current  approach  uses  a  combination  of  parametric  pre¬ 
diction  error  methods  together  with  standard  non-parametric  (spectral)  estimation  methods.  The 
set  of  uncertainty  produced  by  the  modified  identifier  allows  for  performance  evaluation  before  the 
controller  is  adjusted.  If  the  predicted  performance  is  not  satisfactory  then  the  identification  pro¬ 
cess  i6  repeated  under  different  conditions  which  will  reduce  model  uncertainty  where  needed.  The 
resulting  controller  is  designed  to  be  robust  with  respect  to  the  estimated  set  of  uncertainty.  Thus, 
a  large  set  of  uncertainty  requires  a  cautious  or  low  authority  controller,  whereas  a  small  set  of 
uncertainty  will  result  in  a  high  authority  controller.  Also,  the  order  of  the  controller  can  vary  with 
the  set  of  uncertainty.  Details  have  been  reported  in  Kosut(1986, 1987a, b). 
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2.3  New  Research  Directions 


The  previous  discussion  emphasized  two  important  areas  of  basic  research,  namely:  (i)  methods 
of  stability  and  convergence  analysis,  and  (ii)  uncertainty  estimation.  In  order  to  facilitate  our 
research  in  these  areas  we  have  concentrated  on  the  simplest  of  model  parametrizations,  namely, 
finite  dimensional  transfer  functions  where  the  adaptive  parameters  are  the  coefficients.  This  choice 
leads  to  the  standard  forms  of  recursive  parameter  adaptive  algorithms  for  identification  and  adap¬ 
tive  control,  e.g.,  gradient,  recursive-least-squares,  etc.  Despite  this  simple  model  choice,  we  are 
nonetheless  still  faced  with  basic  unresolved  research  issues  in  both  of  the  mentioned  areas.  At 
the  same  time,  we  feel  that  there  is  now  a  sufficient  foundation  to  extend  the  previous  results  and 
develop  new  theory  and  methodology  applicable  to  other  model  parametrizations  and  other  types  of 
controllers.  For  example,  physical  model  parametrizations,  typical  of  LSS  characteristics,  include: 
continuum  models,  wave  models,  and  finite  element  models.  These  types  of  models  will  require  new 
identification  and  adaptive  control  algorithms,  and  will  stretch  and  test  our  previously  developed 
methods  for  stability  and  convergence  analysis.  Other  control  types  include:  nonlinear  controllers 
such  as  might  arise  from  large  angle  maneuvers  or  rapid-retargetting  and  tracking,  and  decentral¬ 
ized  controllers  which  arise  from  restricted  information  distribution,  i.e.,  decentralized  information 
patterns. 

«  .1  ' 

Many  new  unanswered  questions  arise  from  these  choices,  specifically  how  they  affect  system 
identification  and  adaptive  control.  For  example: 

•  What  is  the  best  procedure  for  developing  parameter  estimation  algorithms  from  physical 
model  parametrizations,  e.g.,  gradient,  Gauss-Newton,  etc? 

•  How  do  we  use  an  estimated  physical  model  for  control  design? 

•  Will  our  present  analysis  tools  for  stability  and  convergence  be  valid  for  these  model  choices? 
What  are  the  necessary  modifications? 

•  How  well  do  such  model  forms  approximate  the  "true”  LSS  system?  How  well  do  they 
have  to,  particularly  when  the  intended  use  is  control  design?  What  is  the  effect  on  model 
structure  selection  of  the  control  design  criteria,  such  as  vibration  suppression,  rapid  tracking, 
decentralized  information,  etc? 

•  What  are  the  best  test  conditions  for  the  identification  experiment  when  the  intended  use 
of  the  estimated  model  is  for  control  design?  How  does  one  choose  the  experiment  design 
variables,  such  as  test  signals,  sampling  rate,  data  filters,  and  so  on? 

•  What  is  the  affect  of  utilizing  the  estimated  model  for  nonlinear  control? 

•  How  does  a  decentralized  control  architecture  modify  system  identification?  Under  what 
conditions  will  a  truly  decentralized  adaptive  control  system  converge? 

•  Given  any  model,  is  it  better,  in  general, to  directly  estimate  the  model  or  to  directly  adjust 
the  controller  parameters  based  on  the  model? 


These  questions  broadly  cover  the  basic  research  areas.  However,  it  is  often  difficult  to  discuss 
one  area  without  another. 
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2.4  Model  Structure  Selection  for  Control  Design 


Consider  again  the  adaptive  control  system  depicted  in  Figure  2.3.  The  controller  parameters  are 
adjusted  in  accordance  with  estimated  model  parameters  obtained  from  on-line  data.  The  first  step 
then,  in  designing  the  adaptive  control  system,  is  to  select  a  model  structure  or  parametrization.1 

2.4.1  Model  Parametrization 

We  will  restrict  our  discussion  here  to  parametric  transfer  function  models.  There  are  two  basic 
choices  for  parameters: 

•  physical  parameters  •  1  ' 

•  canonical  parameters 

Physical  parameters  include  masses,  spring  constants,  electrical  circuit  quantities,  etc.  Canoni¬ 
cal  parameters  are  best  exemplified  by  transfer  function  coefficients,  which  usually  depend  in  a 
complicated  manner  on  the  physical  parameters.  Moreover,  if  the  transfer  function  represents  a 
sampled- data  system,  then  the  relation  between  the  physical  parameters  and  the  transfer  function 
coefficients  becomes  even  more  complicated,  particularly  when  the  sampling  rate  is  low. 

In  parameter  estimation,  the  above  parametrizations  offer  different  adavantages  and  disadvan¬ 
tages.  The  estimation  of  transfer  function  coefficients  can  be  very  direct,  because  it  is  possible  to 
express  the  prediction  error  as  a  linear  function  of  these  parameters,  thus  leading  to  simple  estima¬ 
tion  algorithms,  such  as  recursive  least  squares.  Physical  parameters  are  directly  meaningful,  but 
because  they  enter  in  a  complicated  manner  into  the  transfer  function  coefficients,  the  prediction 
error  is  also  a  complicated  function  of  the  physical  parameters,  making  direct  estimation  schemes 
difficult  to  design.  Often  a  system  model  can  be  characterized  by  a  few  physical  parameters, 
whereas  the  transfer  function  may  consist  of  very  many  coefficients. 

One  can  now  ask  the  question:  which  of  the  above  model  parametrizations  is  “better”?  To 
answer  this  question,  we  first  remark  that  neither  is  correct,  because  the  true  system  is  different 
than  either  model.  Even  the  so  called  physical  transfer  function  model  is  undoubtedly  derived 
under  certain  assumptions,  such  as  small  deflections,  linear  elasticity,  uniform  mass  density,  etc. 
Hence,  the  choice  should  be  made  by  considering  the  intended  use  of  the  model. 

2.4.2  Model  Evaluation 

In  control  design,  which  is  the  case  we  consider  here,  what  is  needed  is  an  estimate  of  the  trans¬ 
fer  function,  whose  accuracy  is  determined  by  the  closed-loop  performance.  In  this  regard  the 
parametrization  is  a  convenient  way  to  obtain  this  result.  However,  the  different  parametrizations 
invoke  parameter  estimation  algorithms  which  may  have  significantly  different  convergence  prop¬ 
erties  and  numerical  stability  properites.  Hence,  although  closed-loop  performance  is  the  ultimate 
aim,  other  criteria  must  be  considered  such  as  numerin  its  complexity,  e.g.,  there  should  not  be  too 
many  parameters,  but  just  enough  for  the  intended  use  of  the  model. 

’It  is  possible  to  parametrize  the  closed-loop  system  in  terms  of  controller  parameters,  which  are  then  directly 
adjusted,  rather  than  going  through  an  indirect  route  via  model  parameter  estimation.  An  example  is  the  model 
reference  adaptive  control. 


13 


In  summary,  to  evaluate  a  particular  model  parametrization  for  on-line  parameter  estimation, 
when  the  intended  use  of  the  estimate  is  for  control  design,  the  analyst  needs  to  determine  the 
relations  among  the  following  choices: 

•  a  representative  “true”  system 

•  a  parametric  model  set*  ' 

•  a  model  estimator  or  adaptive  algorithm 

•  a  set  of  closed-loop  performance  criteria 

•  a  control  design  approach,  rule,  or  algorithm 

At  first  glance  it  would  appear  that  these  choices  cannot  be  separately  analyzed.  This  is  not 
the  case  at  all!  A  very  clear  presentation  of  parametric  transfer  function  estimation  is  contained  in 
the  recently  published  textbook  by  Ljung(1987).  [A  brief  summary  is  provided  in  the  Appendix  in 
Section  3.4.]  There  we  find  the  following  important  results: 

•  Least  squares  parameter  estimation  is  equivalent,  asymptotically  in  the  length  of  the  data 
record,  to  minimization  of  a  weighted  frequency  domain  criterion  which  penalizes  the  quadratic 
error  between  the  true  system  transfer  function  and  the  parametric  model  transfer  function. 
The  weighting  function  depends  on  various  user  choices,  Buch  as  the  input  spectrum,  data 
filters,  and  the  noise  or  disturbance  model. 

•  The  mean-square-error  (MSE)  between  the  true  system  transfer  function  and  the  ideal2  es¬ 
timated  transfer  function  can  be  explicitly  evaluated  and  depends  on  the  user  choices  listed 
above. 

One  of  the  most  significant  aspects  of  the  above  results  is  that  the  MSE  btween  a  particular 
choice  of  model  parametrization  and  the  true  system  can  be  evaluated  without  specifying  how  the 
estimate  is  computed  or  whether  it  converges.  The  convergence  question  involves  the  analysis  tech¬ 
niques  reported  in  Anderson  et  al.(1986).  Hence,  conditions  for  convergence  add  further  constraints 
on  the  user  choices.  It  is  important  to  emphasize  that  since  the  above  results  depend  on  the  choice 
of  a  “true”  representaive  system,  they  can  be  used  for  analysis  and  design  of  the  estimator  and 
estimation  experiment. 

Many  research  efforts  have  been  undertaken  to  evaluate  methods  of  system  identification  for 
distributed  parameter  and  LSS  type  systems.  For  example,  we  cite  Goodson  and  Polis(1974),  Banks, 
Crowley,  and  Kunisch(1983),  Rafajlowicz(1983),  Schaechter(1982, 1986),  and  Denman  et  al.(1986), 
to  name  a  few.  The  difference  between  these  reported  results  and  what  we  are  investigating  is 
the  connection  we  make  between  the  criteria  for  system  identification  and  the  intended  use  of  the 
estimated  model  for  control  design.  To  our  knowledge  there  has  never  been  an  organized  effort 
along  the  lines  we  suggest.  It  is  our  contention  that  by  preceding  in  this  organized  manner,  backed 
up  by  the  strong  theoretical  foundations  in  Ljung(1987)  and  in  Anderson  et  al.(1986),  that  results 
can  be  obtained  which  will  be  of  scientific  value  in  themselves,  and  will  significantly  increase  our 
prospectives  for  practical  system  identification  and  adaptive  control  of  LSS. 

*The  ideal  estimate  minimises  jhe  least  sqnres  criterion  for  a  specified  length  of  the  data  record.  This  is  not  the 
estimate  which  is  recursively  computed  from  on-line  data.  These  two  estimates  are,  at  best,  asymptotically  equivalent, 
with  the  recursive  estimate  having  a  larger  variance,  the  sise  of  which  is  dependent  on  the  specific  algorithm  for  on-line 
estimation. 
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2.4.3  Use  of  the  MSE  for  Analysis 

The  MSE  for  model  error  is  a  function  of  the  true  plant,  the  estimation  criteria,  and  the  length 
of  the  data  record.  Hence,  computation  of  the  MSE  provides  an  analysis  tool  for  determining  the 
effectiveness  of  on-line  transfer  function  estimation.  We  remark  again  for  emphasis  t  .at  the  MSE 
is  not  dependent  on  how  the  estimate  is  computed.  The  parametrization  alone  can  be  assessed  for 
control  design. 

We  are  continuing  to  examine  the  effect  of  a  number  of  model  parametrizations  on  the  MSE 
corresponding  to  both  parametric  and  non-parametric  methods  of  transfer  function  estimation. 
Specific  studies  will  include  least-squares  and  least-peak  methods  of  parameter  estimation,  with 
parametric  transfer  function  representations  such  as  might  arise  from  continuum  modeling,  wave 
modeling,  or  finite  element  modeling  of  an  LSS.  The  non-parametric  transfer  function  estimation 
methods  will  be  based  on  spectral  estimation  techniques.  The  effect  of  these  choices  on  estimation 
critera  and  the  MSE  will  be  examined,  specifically  where  the  intended  use  of  the  model  is  for  on-line 
control  design.  Computing  the  MSE  will  be  a  significant  part  of  this  effort. 

Continuum  Modeling  As  one  example  of  the  kind  of  analysis  that  might  be  accomplished, 
we  will  examine  the  use  of  simple  continuum  models,  such  as  beams,  shells,  plates,  etc.,  with 
parametrized  spatially  varying  parameters.  For  example,  a  true  system  can  be  represented  as  a 
non-uniform  elastic  rod,  whereas  the  model  set  is  a  uniform  rod  with  the  parameters  to  be  estimated 
being  the  constant  tortional  mass  and  stiffness.  In  this  case  we  will  try  to  determine  what  amount 
of  distributed  tortional  mass  and  stiffness  can  be  sufficiently  well  approximmated  by  a  uniform 
mass  and  stiffness  for  the  purposes  of  control  design.  We  will  repeat  the  analysis  with  the  same 
true  system,  but  with  other  parametric  models  from  the  above  list  as  brifly  described  below. 

Wave  Modeling  Wave  models  may  be  quite  useful  for  designing  active  controllers  which  act 
like  mechanical  energy  absorbers,  6ee,  e.g.,  von  Flotow(1984),  Hagedorn(1985).  The  wave  model  is 
parametrized  in  terms  of  the  frequency  dependent  “scattering  coefficients”  which  describe  the  wave 
properties. 

Finite  Element  Modeling  The  mass  and  stiffness  matrices  which  arise  from  a  finite  element 
analysis  are  dependent  on  a  variety  of  physical  parameters,  such  as  areas,  radii,  mass  densities, 
etc.  Small  variations  about  nominal  values  can  be  regarded  as  the  model  parameters,  see,  e.g., 
McIntosh  and  Floyd(1985). 

Distributed  Spatial  Measurements  The  same  methodology  permits  evaluating  the  benefits  of 
distributed  spatial  measurements,  or  what  is  more  likely,  many  closely  spaced  measurements.  The 
availability  of  such  measurements  can  significantly  simplify  the  physical  parametrization  of  transfer 
function  models.  We  remark  that  the  spatial  measurements  might  not  be  just  position  and  velocity 
but  might  involve  the  outputs  of  very  many  strain  gauges  placed  appropriately  on  the  structure. 

2.4.4  Experiment  Design 

The  MSE  can  be  used  to  aid  in  designing  the  estimation  experiment.  The  objective  being  that 
during  the  actual  experiment  the  maximum  amount  of  data  is  extracted  in  computing  the  transfer 
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function  estimate.  This  particular  issue,  referred  to  as  experiment  design ,  has  been  extensively 
investigated,  e.g.,  Goodwin  and  Payne(1977).  Recently,  there  has  been  a  specific  interest  in  using 
these  results  in  order  to  design  experiments  which  optimize  the  use  of  the  transfer  function  estimate 
in  control  design,  e.g.,  Ljung(1985),  Wahlberg  and  Ljung(1986),  Gevers  and  Ljung(1986).  These 
results  minimize  a  weighted  norm  of  the  MSE  with  respect  to  free  parameters  or  choices  in  the 
experiment,  e.g.,  input  spectrum,  model  order,  etc.  Since  the  MSE  depends  on  the  true  plant,  the 
methods  offer  usefull  guidelines  for  designing  the  experiment.  The  basic  idea  is  to  make  the  criterion 
for  estimation  similar  to  the  criterion  for  control  design  as  listed  under  Task  1  in  Section  2.3. 

At  the  present  time  the  use  of  the  MSE  as  a  measure  of  experiment  design  for  the  case  when  the 
intended  use  of  the  transfer  function  estimate  is  control  synthesis  is  still  in  the  beginning  stages. 
One  of  the  major  problems  is  that  measures  of  closed  loop  performance  are  complicated  functions 
of  the  MSE  and  the  criterion  for  estimation. 

We  have  been  studying  this  relationship  with  particular  emphasis  on  the  LSS  dynamic  repre¬ 
sentations  previously  mentioned  and  the  control  design  appraoches  discussed  next. 


2.4.5  Control  Design  Criteria  and  Approaches 

In  the  previous  discussion  we  have  emphasized  the  need  to  evaluate  the  model  structure  for  identi¬ 
fication  in  terms  of  the  intended  use  of  the  estimated  model  for  control  design.  In  this  section  we 
briefly  discuss  control  design  criteria  and  approaches.  The  control  design  criteria  typically  include: 

•  vibration  suppression 

•  regulation 

•  tracking 

Typical  constraints  include: 

•  robustness  to  model  error 

•  actuator  limitations  •  ' 

•  decentralized  architectures 

Control  design  methods  for  achieving  these  goals  are  hard  to  classify  or  codify,  but  a  partial  list 
is  as  follows: 

•  pole  placement 

•  minimum  variance 

•  LQG 

•  stable  factorization 

•  ^oo-optimization 
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First  of  all,  it  is  not  possible  here  to  discuss  all  these  methods.  Secondly,  what  is  of  pri¬ 
mary  interest  is  the  goal  of  the  methodology  and  how  it  is  reflected  in  the  criterion  for  system 
identification.  Now,  although  it  is  not  easily  apparent,  the  above  design  approaches  can  all  be 
subsumed  using  the  so-called  stable  factorization  (SF)  approach  as  presented  in  Desoer  et  al.(1980) 
and  Vidyasagar(1985). 


Stable  Factorization  Approach  The  basic  idea  is  to  “factor”  the  plant  into  a  ratio  of  two 
stable  transfer  functions,  or  operators,  in  the  more  general  case.  It  can  be  shown  that  these 
factors  allow  for  the  parametrization  af  all  stabilizing  controllers  in  terms  of  a  single  stable  transfer 
function,  referred  to  as  the  Youla  parameter  [Youla  et  al.,  1976].  Moreover,  and  this  is  perhaps 
the  most  significant  aspect  of  this  controller  representation,  all  the  closed-loop  transfer  functions 
from  exogenous  inputs  to  internal  outputs  depend  linearly  on  the  Youla  parameter.  The  use  of  this 
approach  specifically  for  vibration  control  of  lightly  damped  systems  and  LSS  can  be  found  in 
Bennett  et  al.(1987),  Helmicki  et  al.(1987),  and  Pichet(1985). 

To  relate  the  SF  approach  to  pole  placement  simply  means  to  select  the  6table  factors  so  that 
the  poles  are  placed  where  desired  in  the  factors. 

The  LQG  approach  requires  the  introduction  of  a  norm  in  the  signal  space,  i.e.,  average  integral 
squared  in  time.  This  translates  to  what  is  called  the  Hi  norm  of  the  closed-loop  transfer  function 
matrix,  i.e.,  integral  squared  in  the  frequency  domain.  In  fact,  the  LQG  state  space  controller- 
observer  solution  is  equvalent  to  a  particular  stable  factorization,  see,  e.g.,  Nett  et  al.(1984). 


Robustness  The  SF  approach  also  provides  for  a  natural  measure  of  robustness  to  model  error. 
It  turns  out  that  the  weakest  topology  for  handling  robustness  is  exactly  a  ratio  of  stable  factors, 
Vidyasagar(1984).  Thus,  robustness  to  model  error  is  more  naturally  seen  as  perturbations  in 
the  stable  factors.  The  appropriate  norm  is  the  peak  value,  as  a  function  of  frequency,  of  the 
closed-loop  transfer  function,  referred  to  as  the  /foo-norm.  Techniques  for  #oo -optimization  are 
attempting  to  find  that  control  which  achieves  optimal  robustness  to  model  error,  see,  e.g.,  Francis 
and  Zames(1983),  Boyd  et  al.(1987). 

Preliminary  analysis  of  robustness  to  model  error  in  the  LSS  environment  has  been  examined 
in  Kosut  et  al.(1983),  Bhaya  and  Desoer(1985),  to  name  a  few.  We  have  examined  the  use  of  SF 
approaches  and  how  to  affect  the  identification  criterion  so  as  to  obtain  an  estimate  which  results 
in  a  control  design  that  is  maximally  robust. 


2.5  Computing  the  Model  Estimate  from  Data 

In  general,  adaptive  algorithms  share  a  similar  genesis  and  purpose,  namely  to  adjust  a  parameter 
vector  to  asymptotically  approach  a  member  of  the  set  which  minimizes  an  average  squared  error, 
such  as  prediction  error  for  system  identification  or  tracking  error  for  direct  adaptive  control. 

In  the  ideal  case,  the  parametrization  can  be  selected  so  as  to  achieve  what  is  called  perfect 
matching ,  i.e.,  for  some  restricted  class  of  exogenous  inputs  (usually  no  disturbances),  and  a  suffi¬ 
cient  number  of  parameters  (no  unmodeled  dynamics),  there  is  a  single  minimum  which  produces 
either  zero  error  or  the  error  is  zero-mean  white  noise. 

Under  realistic,  but  non-ideal,  conditions,  the  best  that  can  happen  is  that  the  adaptive  pa- 
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rameters  asymptotically  approaches  a  small  neighborhood  of  a  local  minima  of  the  average  squared 
error,  see,  e.g.,  Anderson  et  al.(1986). 


2.5.1  Adaptive  Algorithms 

Parameter  Adaptation  One  method  to  achieve  this  result  is  to  adjust  the  parameters  in  the 
negative  gradient  direction  of  the  instantaneous  squared  error,  a  procedure  that  was  once  referred  to 
as  the  “MIT- Rule”  [see  Section  2.7.1].  More  general  procedures  involve  adjustments  in  accordance 
with  Gauss-Newton  iterations,  thereby  taking  into  account  second  order  effects.  The  recursive  least 
square  algorithm  is  the  classic  example  of  this  approach.  These  methods  can  be  extended  for  a  host 
of  error  measures,  which  are  not  required  to  be  quadratic,  see,  e.g.,  Ljung  and  Sodestrom(1983), 
Goodwin  and  Sin(1984). 


Constructing  the  Error  Gradient  from  Measured  Signals  A  difficulty  with  constructing 
the  above  algorithms  is  that  the  error  gradient  with  respect  to  the  parameters,  as  if  they  were 
fixed,  is  required.  However,  in  most  cases,  the  error  gradient  is  a  function  of  the  true,  but  unknown 
plant.  The  actual  algorithm  must  then  U6e  an  approximation,  which  is  loosely  referred  to  as  the 
regressor.  (An  example  of  constructing  the  regTessor  for  a  simple  continuum  model  is  provided  in 
the  Appendix  in  Section  3.7.3.)  f 


Algorithm  Modifications:  “AI”  Features  To  insure  that  adaptive  algorithms  are  working, 
one  often  includes  various  “safety  nets”  or  AI  features.  For  example,  in  uncertainty  estimation, 
the  controller  is  not  changed  until  some  level  of  confidence  is  established.  Other  AI  features 
include  monitoring  of  information,  covariance  monitoring,  perisitent  excitation  conditioning,  and 
parameter  projection,  to  name  a  few.  Various  normalizations  of  the  measured  sugnals  also  enhance 
the  ability  of  adaptive  systems  to  insure  bounded  parameter  values.  These  latter  approaches  need 
prior  information  regarding  model  error  and  parameter  ranges. 


On-Line  Control  Design  Rules  Once  the  estimated  model  is  obtained,  the  control  design  rule 
(see  Figure  2.3)  transforms  the  estimate  to  a  controller  which  is  then  implemented  with  the  actual 
system.  If  the  model  estimate  includes  an  uncertainty  estimate  as  well,  then  the  control  design  rule 
will  involve  a  robust  design  procedure  such  as  outlined  in  our  discussion  of  the  stable  factorization 
approach. 

We  will  continue  to  develop  and  evaluate  the  control  design  rule  based  on  the  model  structure 
selection  and  the  control  design  criteria  listed  before.  Once  the  design  rule  is  established,  we  need 
to  then  turn  our  attention  to: 

•  experiment  design 

•  stability  and  connvergence  analysis 

These  issues  will  be  discussed  in  the  following  sections.  The  experiment  design  problem  involves 
computing  the  MSE.  The  stability  and  convergence  analysis  involves  the  method  of  averaging  and 
other  extentions  and  new  techniques  as  required. 
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2.5.2  Performance  of  the  Algorithm 

Despite  the  above  approximation,  from  a  practical  point  of  view  it  is  certainly  acceptable  that 
the  parameters  approach  and  remain  in  a  small  neighborhood  of  a  local  minimum  of  the  average 
squared  error,  provided  that  members  of  this  set  also  produce  acceptable  performance. 

We  refer  to  each  member  of  this  set  as  a  tuned  parameter ,  and  to  the  corresponding  feedback 
system  as  the  tuned  system  [see,  e.g.,  Kosut  and  Friedlander(1985)].  Clearly  the  tuned  system  is 
the  adaptive  feedback  system  of  Figure  2.3  with  the  adaptive  parameters  held  fixed  at  a  tuned 
setting.  We  can  now  pose  the  following  questions  regarding  the  adaptive  system  : 

1.  How  do  the  tuned  parameters  depend  on  the  exogenous  inputs? 

2.  Is  the  adaptive  system  stable  in  a  neighborhood  of  the  tuned  system.  Furthermore,  how  small 

is  this  neighborhood? 

.  .  1  - 

3.  What  is  the  region  of  attraction  to  a  small  neighborhood  of  the  tuned  system. 

4.  What  is  the  rate  of  convergence  to  this  small  neighborhood. 

These  questions  can  only  be  answered  by  application  of  methods  of  stability  and  convergence 
analysis.  (See  Sections  2.7  and  3.8.) 

An  important  part  of  our  research  effort  will  be  in  developing  suitable  approximations  to  the 
error  gradient.  The  suitability  of  a  particular  approximation  will  of  course  be  dependent  upon  the 
results  of  the  stability  and  convergence  analysis. 

2.5.3  Multiple  Model  Adaptive  Control 

We  are  also  examining  an  adaptive  control  scheme  which  involves  the  selection  of  a  fixed  controller 
out  of  a  finite  collection,  where  each  controller  is  robustly  designed  to  account  for  the  true  plant 
being  in  a  not  necessarily  small  set  of  uncertainty.  There  are  no  adaptive  “parameters”  in  the 
conventional  sense  where  adjustments  are  made  to  either  model  or  controller  parameters.  The 
parameters  here  involve  pseudo-probabilities  or  weights  assigned  to  each  of  the  finite  controllers. 
These  quantities  are  computed  recursively  from  the  measured  data.  Various  adaptive  selection 
mechanisms  which  depend  on  these  pseudo-probabilities  will  be  developed  as  appropriate. 

Figure  2.5  depicts  the  general  adaptive  set-up  that  we  consider.  The  design  of  each  controller 
Ci{z)  is  not  an  adaptive  control  design,  but  rather,  a  robust  control  design  task.  The  signals  used 
to  adaptively  select  the  controller  are  the  innovations  sequences,  or  prediction  errors,  denoted  by 
£\{t),-’‘i£N(t),  where  each  is  obtained  from  the  Kalman  filters  denoted  KF\,...,KFn • 

Our  main  task  is  to  explain  how  the  best  controller  in  the  set  of  controllers  can  be  selected  to 
control  the  plant.  This  type  of  adaptive  controller  is  not  like  the  conventional  ones  where  either 
model  or  controller  parameters  are  directly  adjusted.  Here  the  parameters  are  contained  in  the 
mechanism  for  switching  amongst  the  preselected  robust  controllers,  which  is  essentially  a  gain 
scheduling  procedure,  but  is  adaptive  in  the  sense  that  the  schedule  is  being  learned  from  the 
measured  data.  The  gain  schedule  is  usually  set  in  advance.  For  example,  in  a  flight  control  system 
the  gain  schedule  is  a  predetermined  function  of  the  Mach  number  and  aerodynamic  pressure. 
One  of  the  interesting  possible  advantages  of  this  method  is  that  although  the  plant  may  have  a 
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Figure  2.5:  Multiple  Model  Adaptive  Control 


large  number  of  uncertain  parameters,  it  is  possible  that  only  a  few  controllers  are  required,  and 
hence  only  a  few  parameters  in  the  selection  mechanism.  Also  the  individual  robust  controllers  can 
be  based  on  uncertainty  in  physical  parameters  rather  than  canonical  parameters,  6uch  as  transfer 
function  coefficients,  as  used  in  the  conventional  parameter  estimation  techniques,  e.g.,  least  squares 
with  a  linear  regression  model. 

The  adaptive  selection  mechanism  as  described  above  has  been  investigated  in  Anderson  and 
Moore(1979)  and  Athans  et  al.(1977).  In  this  latter  reference  this  scheme  is  referred  to  as  a  Multiple 
Model  Adaptive  Control  (MMAC),  but  the  individual  controllers  are  not  selected  to  be  robust  in 
the  manner  described  above.  We  are  developing  an  analysis  of  the  convergence  properties  of  the 
adaptive  selection  algorithm  following  the  analysis  in  Anderson  and  Moore(1979). 


2.6  Computing  Model  Uncertainty  from  Data 

The  requisite  information  for  robust  feedback  stabilization  is  typically  a  nominal  model  of  the 
plant  together  with  an  uncertainty  profile.  From  the  above  discussion,  computation  of  the  MSE 
for  model  error  allows  one  to  evaluate  a  control  design  which  is  based  on  a  nominal  (estimated) 
model  together  with  an  a  priori  estimate  of  model  accuracy.  We  will  use  both  parametric  and 
non-parametric  methods  of  transfer  function  estimation,  where  the  non-parametric  methods  yield 
direct  estimates  of  the  accuracy.  Essentially,  'his  approach  provides  a  direct  estimate  of  the  MSE. 
The  accuracy  to  which  the  estimate  is  set  depends  on  the  criteria  for  control  design. 

The  reason  that  we  consider  non-parametric  methods  is  that  they  can  provide  estimates  of 
model  accuracy  as  a  function  of  frequency,  particularly  over  those  frequency  ranges  where  either 
the  model  structure  is  poorly  known,  or  else  the  model  accuracy  in  that  range  is  unimportant  for 
control  design.  The  feasibility  of  using  spectral  techniques  for  the  estimation  of  model  error  can  be 
found  in  Kosut  ( 1986, 198 7a)i  and  for  LSS  application  in  Kosut(1987b). 
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2.7  Methods  of  Stability  and  Convergence  Analysis 

.  ■  >  ' 

In  order  to  more  fully  appreciate  the  significance  of  the  convergence  and  stability  problem  in 
adaptive  control  we  offer  the  following  historical  digression.  This  will  also  serve  to  motivate  and 
place  our  previous  results  and  proposed  new  research  directions  in  this  area. 


2.7.1  A  Bit  of  History 

Over  the  years,  the  application  of  adaptive  control  theory,  as  reported  in  the  literature,  has  not 
met  with  a  uniform  success.  Surprisingly,  some  early  applications  to  flight  control  were  reported 
to  be  quite  successful  despite  the  lack  of  theory,  e.g.,  Whitaker(1959)  and  Osborne  et  al.(1961).  In 
more  recent  years,  with  more  theory  available,  some  reported  applications-again  to  flight  control- 
were  not  deemed  successful,  e.g.,  Bryson(1977).  At  the  same  time  as  these  mixed  results  were  being 
obtained  in  the  area  of  flight  control  applications,  several  researchers  were  reporting  good  results 
with  applications  in  process  control  and  in  ship  steering,  e.g.,  Astrom  et  al.(1965, 1973).  Why  these 
mixed  results? 

The  answer  lies  partly  in  the  engineering  aspects  which  motivate  the  use  of  adaptive  control,  and 
partly  in  the  overall  research  goals  in  developing  an  adaptive  control  theory.  These  are  sometimes 
in  conflict.  The  engineering  application  calls  for  a  control  structure  which  takes  into  account  the 
nature  of  the  physical  problem.  The  theory  is  looking  to  solve  a  canonical  problem,  and  hence,  often 
fails  to  account  for  a  priori,  or  experiential  “rule-based”  knowledge.  A  good  example  is  proxided 
by  a  brief  history  of  the  adaptive  control  technique  developed  by  Whitaker(1959),  later  referred  to 
as  the  “M.I.T.-Rule”.  The  basic  idea  is  to  adjust  the  adaptive  control  parameters  in  accordance 
with  an  instantaneous  gradient  descent  of  the  squared  error  signal  as  described  in  Section  3.4. 

Although  the  resulting  algorithm  is  simple  to  construct,  and  incidently  worked  well  enough 
in  simulations,  there  was  no  guarantee  the  resulting  adaptive  system  would  be  stable.  Probably 
because  the  causes  of  these  difficulties  were  not  readily  forthcoming,  and  perhaps  also  there  was 
(and  still  is)  the  desire  to  construct  a  “universely”  stabilizing  adaptive  control,  other  approaches 
to  adaptive  control  were  explored. 

One  such  avenue  was  announced  in  a  paper  by  Parks(1966),  where  the  M.I.T.-Rule  was  “re¬ 
designed”  in  such  a  way  that  the  resulting  adaptively  controlled  system  had  a  guaranteed  stability 
property.  The  idea  was  to  make  a  certain  transfer  function  strictly  positive  real  (SPR)  which  has 
the  effect  of  insuring  that  the  parameter  adjustment  is  always  in  the  right  direction.  This  “SPR- 
Rule”  idea  was  developed  later  for  more  general  situations,  e.g.,  Monopoli(1974),  Egardt(1979), 
and  Narendra,  Lin,  and  Valavani(1980).  Further  extentions  and  expositions  along  the  SPR  line  can 
be  found  in  Landau(1979),  Goodwin  and  Sin(1983),  and  Kosut  and  Friedlander(1985). 

Unfortunately,  these  SPR- Rule  algorithms  have  exactly  the  same  stability  problems  as  the  MIT- 
Rule  algorithms,  but  only  when  unmodeled  dynamics  and  disturbances  are  taken  into  account.  This 
issue  was  perhaps  not  fully,  realized  until  the  appearance  of  the  work  of  Rohre  et  al.(1982),  which 
vividly  demonstrated  the  potentially  local  unstable  nature  of  adaptive  systems  by  posing  a  “counter¬ 
example”  representative  of  a  non-ideal,  but  practical,  circumstance.  The  adaptive  parameters 
exhibited  the  characteristic  rapid  transient  followed  by  a  steady  parameter  drift.  In  this  case, 
however,  the  parameters  did  not  settle  down  in  the  constant  parameter  stability  set.  Thus,  once  the 
parameters  drifted  outside  the  stability  set,  the  states  of  the  controlled  system  became  exceedingly 
large. 
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Cause  of  Parameter  Drift  Instability  In  scanning  the  literature,  the  earliest  reference  which 
contains  a  rigorous  analysis  of  this  instability  phenomena  appears  to  be  in  a  note  by  James(1971). 
A  Floquet  analysis  is  applied  to  an  MIT-Rule  one-parameter  linear  adaptive  system  (adaptive 
feedforward  only)  with  periodic  inputs.  The  result  is  a  complex  stability-instability  boundary 
revealing  multiple  resonance  phenomena,  precisely  like  that  associated  with  the  Mathieu  equation, 
e.g.,  Hale(1969).  No  nonlinear  analysis  was  undertaken  for  the  case  with  adaptive  feedback. 

The  work  of  Ljung(1977)  which  utilizes  a  technique  of  stochastic  averaging  can  perhaps  be 
said  to  herald  the  beginning  of  a  nonlinear  analysis  technique,  but  its  applicability  to  adaptive 
systems  in  non-ideal  situations  was  not  fully  explored  at  that  time.  Lyapunov  techniques,  explored 
by  Anderson  and  Johnstone(19S3)  and  Ioannou  and  Kokotovic(1983),  showed  that  persistently 
exciting  signals  are  required  to  provide  a  uniform  asymptotic  stability  of  the  ideal  adaptive  system, 
and  this  in  turn  provides  robustness  to  various  unmodeled  dynamics  and  disturbances.  Input- 
output  formulations  providing  a  local  stability  were  developed  in  Kosut  and  Johnson(1984)  and 
Kosut  and  Anderson(1986);  these  also  needed  persistent  excitation.  In  none  of  these  examinations 
of  the  stability  properties  of  adaptive  control  systems  was  the  precise  mechanism  for  the  parameter 
drift  instability  identified,  particularly  for  the  counter-example  posed  by  Rohrs  **t  al.(1982). 


Method  of  Averaging  The  first  insights  came  in  a  series  of  papers  by  Astrom(1983,1984), 
which  provided  an  analysis  based  on  the  classical  method  of  averaging  of  Bogoliubov  and  Mitropol- 
ski(1961).  The  method  requires  slow  adaptation  and/or  small  signal  magnitudes,  and  under  these 
conditions  Astrom  was  able  to  show  heuristically  the  instability  mechanism.  This  was  made  piecise 
in  a  paper  by  Riedle  and  Kokotovic(1984, 1985),  where  they  established  am  average  SPR  condition 
which  provides  a  sharp  stability-instability  boundary,  again  in  the  case  of  slow  adaptation  with 
periodic  inputs,  but  only  for  the  linearized  adaptive  system.  The  average  SPR  condition  is  sig- 
nificantly  less  restrictive  than  the  usual  SPR  condition  because  the  latter  requires  that  a  certain 
closed-loop  transfer  function  be  positive  at  all  frequencies,  whereas  the  former  depends  also  on  a 
signal  spectrum  and  requires  that  the  energy  at  those  frequencies  where  the  transfer  is  positive 
should  dominate  those  frequencies  where  it  is  negative.  This  result  put  teeth  into  an  obvious  en¬ 
gineering  folk  theorem.  Extentions  of  this  important  result  to  other  than  periodic  signals  and  the 
relation  to  persistent  excitation  is  provided  in  Kosut,  Anderson,  and  Mareels(1986).  Extending  the 
method  of  averaging  to  the  full  adaptive  feedback  case  has  also  been  developed,  e.g.,  Riedle  and 
Kokotovic(1986),  Kosut(1986),  and  Bodsen  et  al.(1986). 

In  the  light  of  the  new  averaging  results  it  is  now  possible  to  more  fully  understand  the  difficulties 
with  the  MIT-Rule.  It  is  clear  what  causes  the  instabilities,  and  most  importantly,  how  to  avoid 
them.  The  MIT-Rule  is  “re-visited”  with  these  new  tools  in  hand  in  Mareels  et  al.(1986)  and 
Mareels(1986),  and  results  are  obtained  which  essentially  extend  the  results  of  James(1971). 

2.7.2  New  Research  Directions 

The  conclusion  from  this  bit  of  history,  particularly  for  adaptive  control  of  LSS  is  this:  the  pa¬ 
rameters  for  adaptation  can  correspond  to  physically  meaningful  parameters,  rather  than  canonica1 
parameters  such  as  transfer  function  coefficients.  Once  having  selected  the  critical  parameters,  the 
parameter  adaptive  algorithm  can  be  developed  from  the  intuitively  appealing  MIT-Rule,  or  any 
similar  procedure.  Having  established  the  adjustment  procedure,  stability  of  the  complete  adap¬ 
tive  control  system  can  be  analyzed  using  the  method  of  averaging.  This  will  provide  conditions 
for  instability  which  need  to  be  either  avoided,  or  if  not,  then  provide  guidelines  for  control  law 
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modifications. 

Taking  this  approach  is  not  as  straight  forward  as  it  might  sound.  There  are  some  pitfalls  to 
avoid  and  definitely  areas  where  basic  research  is  required.  These  include  the  following: 

•  The  theoretical  justification  for  the  method  of  averaging  is  that  a  certain  design  parameter 
(not  one  of  the  adaptive  parameters)  is  sufficiently  small.  In  adaptive  systems,  this  design 
parameter  depends  on  the  speed  of  adaptation  and  the  magnitude  of  exogenous  inputs  such 
as  reference  commands  and  disturbances.  Current  methods  to  predict  the  maximum  size  of 
this  parameter  have  been  shown  by  simulation  to  be  very  conservative.  That  is,  the  stability 
properties  of  the  adaptive  system  as  obtained  from  an  averaging  analysis,  are  consistent  with 
simulations  even  for  parameter  values  which  greatly  exceed  the  theoretical  limit.  This  is  an 
area  which  needs  to  be  examined,  since  we  do  not  want  to  be  overly  conservative. 

•  Averaging  analysis  works  for  slow  parameter  variation,  because  the  parameters  are  then 
not  being  radically  affected  by  any  transients.  The  restrictiveness  of  slow  adaptation  can  be 
removed,  but  this  will  require  a  better  understanding  of  the  transient  behavior  of  the  adaptive 
system.  Some  preliminary  results  utilizing  fixed-point  theory  and  Floquet  theory  have  been 
examined  in  Kosut  and  Bitmead(1986, 1987).  This  is  a  most  difficult  problem,  but  the  payoff 
for  LSS  systems  would  be  significant. 

•  Although  the  method  of  averaging  in  principal,  involves  straightforward  calculations,  even 
simple  examples  can  just  barely  be  worked  out  by  hand.  It  is  clear  that  the  level  of  complexity 
of  a  realistic  adaptive  or  otherwise  nonlinear  system  is  well  beyond  hand  calculation.  Hence, 
for  such  powerful  analytic  methods  to  be  of  practical  benefit,  it  is  imperative  to  develop  “user- 
friendly”  software  tools  which  provide  the  requisite  nonlinear  analysis.  At  the  present  time, 
aside  from  simulation  capability,  there  are  no  available  software  tools  for  dealing  principally 
with  adaptive  systems,  and  certainly  none  for  more  general  nonlinear  systems.  This  is  a 
research  issue  in  both  mathematics  and  computation,  and  it  is  one  that  may  prove  essential 
to  some  of  the  tasks  we  propose  to  undertake,  although  it  is  not  a  major  thrust  in  our  research 
program.  Nonetheless,  it  is  sometimes  an  overlooked  research  issue,  and  thus  warrants  more 
than  a  passing  remark. 

•  The  method  of  averaging  so  far  has  been  extended  and  applied  only  for  adaptive  linear  control 
systems,  that  is,  if  the  adaptive  parameters  were  held  fixed,  then  the  resulting  closed-loop 
system  would  be  linear.  This  is  certainly  not  the  case  during  rapid  slewing  maneuvers  which 
involve  nonlinear  kinematics  as  well  as  nonlinear  control.  Moreover,  the  control  system  itself 
could  be  nonlinear,  despite  the  fact  that  certain  parameters  can  be  adapted.  For  example,  a 
time-optimal  control  of  a  double  integrator  (e.g.,  a  single  point  mass  system)  is  a  nonlinear 
function  of  position  and  velocity.  Although  the  method  of  averaging  can  handle  nonlinear 
systems  in  theory,  see,  e.g.,  Hale(1969),  the  application  to  adaptive  nonlinear  control  remains 
an  open  area  for  basic  research. 
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Chapter  3 

Identification  for  Control  Design 


3.1  Introduction 


*  '  *■ 

In  order  to  evaluate  a  particular  model  parametrization  in  system  identification,  when  the  intended 
use  of  the  model  is  for  control  design,  the  analyst  needs  to  specify: 


•  a  representative  “true”  system 

•  a  model  parametrization 

•  an  estimation  algorithm 

•  a  set  of  closed-loop  performance  criteria 

•  a  contol  design  approach  or  rule 

3.2  “True”  System  vs.  Model  Set 

The  relation  among  these  choices  can  be  seen  by  considering  the  following  example.  Suppose  the 
uer  system  is  an  undamped  elastic  rod  of  length  t  with  a  torque  u(t)  applied  at  one  end.  Assuming 
o  external  disturbances,  the  motion  of  the  rod  for  small  angular  deflections  is  well  approximated 
by  tb“  partial  differential  equation 

m(*)z«(t,i)  +  \p{x)zx{t,x)]x  =  0  (3.1) 

with  boundary  conditions 

z*«,0)  =  -ti(f)/p(0),  zx(t,l)  =  0  (3.2) 

where  m(x)  represents  the  tortional  mass  density,  p(x)  represents  the  tortional  stiffness,  and  z(t,x) 
is  the  angular  position  of  the  rod  at  time  t  and  position  x  €  [0,/].  Let  y(t)  denote  the  output  of  a 
sensor  which  perfectly  accurately  measures  the  angular  velocity  of  the  rod  at  the  actuator  location 
x  =  0.  Thus, 

y(t)  =  *«(t,0)  (3.3) 
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It  is  clear  that  the  relation  between  the  actuator  input  u(t)  and  the  velocity  sensor  output  y{t)  is 
given  by 

V  (t)  =  Po(s)u(t)  (3.4) 

where  P0(s)  is  the  transfer  function1  from  tt (t)  to  y(t). 

Suppose  we  now  choose  as  a  parametric  model  of  (3.1)  a  uniform  undamped  elastic  rod  of 
length  l,  with  the  same  torque  u(t)  applied  at  one  end.  Assuming  no  external  disturbances,  the 
motion  of  the  rod  for  small  angular  deflections  is  well  approximated  by  the  wave  equation  [see,  e.g., 
Pichet(1985)] 

nr22U(t,x)  -  GJz„(t,x)  =  0  (3.5) 

i  ■  .  I  r 

with  boundary  conditions 

*.M)  =  -£jti(t),  **(*.0  =  0  (3.6) 

where  /i  is  mass  density,  r3  is  radius  of  gyration,  and  GJ  is  the  tortional  stiffness.  Define  the 
physical  parameter  vector 

e  =  (a,  0)T  =  {Jpr'GJ,  yJ^)T  (3.7) 

Then,  the  parametric  model  transfer  function  from  u(t)  to  y(t)  is 

1  11  +  e~2p* 

P{s,8)  =  -coth  (/3s)  =  (3.8) 

which  can  be  viewed  as  a  physical  parametrization. 

Unless  m(x)  and  p{x)  in  (3.1)  are  constant,  it  is  unlikely  that  there  exist  any  values  of  a  and 
P  in  (3.8)  such  that  P{s,9)  =  P0(s).  However,  there  certainly  could  be  model  parameter  values 
which,  for  some  class  of  functions  m(x)  and  p(x),  the  model  error  P(s,8)  -  P0(s)  is  sufficiently 
small,  in  some  sense,  to  satisfy  closed  loop  performance  demands.  To  be  more  precise  about  how 
to  measure  model  error,  we  briefly  state  some  known  results  about  robustness  of  feedback  systems 
to  model  error. 


3.3  Conditions  for  Closed-Loop  Stability  Robustness 


Suppose  we  have  selected  a  control  design  rule  which  for  every  value  of  the  physical  parameter 
vector  9  produces  a  feedback  controller  with  transfer  function  C(s,  9),  such  that  the  feedback  system 
consisting  of  P(s,9)  and  C(s,9)  is  stable  and  also  satisfies  the  performance  demands.  Then,  this 
same  controller  will  also  stabilize  the  true  system  P0(»)  if 


|  P(ju,9)-P0(ju)\- 


C(ja>}8) 

1  +  P(ju,8)C(ju,8) 


<  1  ,  Vu>  6  R 


(3.9) 


This  is  a  typirj  result  arising  from  robust  control  theory,  see,  e.g.,  IEEE  Transactions  on  Au¬ 
tomatic  Control:  Special  Issue  on  Robust  Control  (1981).  More  recent  results  are  more  extensive 
involving  similar  ideas  but  the  formulations  would  take  us  too  far  afield  in  this  discussion,  see,  e.g., 

1Tbe  term  is  sted  loosely.  More  precisely,  y(t)  «=  (P0u)(t)  where  P0  is  s  convolution  operator  with  kernal  p0(<) 
whose  Laplace  transform,  or  '‘transfer  function”,  is  Po{*)- 


r 
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Vidyasagar(1985).  We  remark  that  a  similar  result  holds  for  discrete- time  feedback  systems.  In 
this  case,  the  above  condition  becomes 


«*■*> <‘-v^ '-'-'I 


(3.10) 


where  we  use  “z-transforms”  rather  than  Laplace  transforms  for  the  discrete-time  linear  operators 
P,  P0,  and  C.  We  will  return  to  the  sampled-data  representation  in  the  sequel. 

If  for  feedback  design  the  above  inequality  is  essentially  what  is  needed  to  insure  stability 
robustness,  then  there  is  nothing  particularly  precious  about  the  physical  parametrization  of  (3.8). 
One  could  just  as  well  choose  the  canonical  transfer  function  representation 


PM  = 


bis"-1  +  ---  +  6n 
*"  + -H»iSn_1  -| - (-  an 


(3.11) 


where  now  6  is  the  physical  parameter  vector 


9  (flj  9  •  •  •  y  i  •••  >  ^n) 


(3.12) 


Observe  that  the  physical  parameters  (a,/?)  uniquely  determine  the  canonical  parameters 
(a1,...,an,61,...,6n),  but  the  relationship  is  obviously  complicated.  The  primary  disadvantage 
of  using  the  canonical  parameters  is  that  there  could  be  many  more  than  the  physical  parameters, 
especially  in  the  case  where  good  model  accuracy  is  required  over  a  wide  band  of  frequencies.  As 
we  have  mentioned,  however,  it  is  in  general  much  easier  to  construct  numerically  stable  parameter 
estimation  algorithms  for  canonical  parameters  than  for  physical  parameters. 

The  above  simple  example  illustrates  the  difficulties  involved  in  choosing  a  model  parametriza¬ 
tion.  Obviously  the  extention  to  the  LSS  environment  is  not  so  straightforward  because  of  the 
additional  complexity  generated  by  dither  the  large  physical  size  of  the  structure  or  the  large  num¬ 
ber  of  modes  within  the  controller  bandwidth.  Before  discussing  these  issues  we  first  review  some 
of  the  known  results  in  the  estimation  of  transfer  functions  from  measured  data. 


3.4  Review  of  Transfer  Function  Estimation 

The  estimation  of  a  system's  transfer  function  from  input-output  data  has,  of  course,  a  long  his¬ 
tory,  and  we  will  not  attempt  to  document  that  here.  There  are  many  excellent  survey  articles  and 
textbooks  that  can  be  referenced,  e.g.,  Jenkins  and  Watts(1968),  Astrom  and  Eykhoff(1970),  Auto- 
matica:  Special  Issue  on  Identification^  zn.  1981),  Ljung  and  Soderstrom(1983),  and  Ljung(1987), 
to  name  a  few.  These  references  clearly  explain  the  theory  and  practice  of  both  parametric  and 
non-parametric  methods  of  transfer  function  estimation.  Parametric  methods  usually  involve  the 
minimization  of  some  time^ompin  function  of  the  parameters  using  an  iterative  or  recursive  al¬ 
gorithm.  Non-parametric  methods  involve  the  computation  of  correlation  functions  and/or  their 
respective  spectral  densities.  In  either  case,  it  is  possible  to  obtain  theoretical  results  which  provide 
asymptotic  expressions,  as  the  data  record  length  increases,  for  the  mean-square-error  (MSE)  of 
the  transfer  function  estimate,  which  is  precisely  the  quantity  that  refects  model  accuracy. 
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3.4.1  Sampled-data  Representation  and  Data  Structure 

Suppose  that  the  true  plant  system  to  be  estimated  can  be  described  by  the  discrete-time  relation2 

y(t)  =  Po(qHt)  +  d(t)  (3.13) 

where  y(t)  and  u(t)  are  the  measured  input  and  output,  P0(q)  is  the  transfer  function,  and  d(t) 
is  the  disturbance.  It  is  further  assumed  that  d(t)  is  the  output  of  a  linear  system  with  transfer 
function  W0(q)  which  is  driven  by  white  noise  u0(t)  of  intensity  A0.  Thus, 

d(t)  =  W0(q)u0(t)  (3.14) 

and  hence,  has  the  spectral  density 

Su(u)  =  \W0{j“)\2\0  (3.15) 

The  estimation  problem  is  to  select  out  of  a  model  set,  an  estimate  of  P0(q)  and  WQ(q)  from  the 
observed  finite  data  record 

z*  =  {y(t),u(t):t=l . N }  (3.16) 

Observe  that  if  P0(s )  is  the  true  continuous  time  transfer  function,  see,  e.g.,  (3.1),  then  the  sampled- 
data  representation  is 

P0(9)  =  (l-r,)2{;PoW}  (3.17) 

where  Z{-}  denotes  the  usual  z-transform  operator. 

3.4.2  Prediction  Error  Methods  of  Parameter  Estimation 

Parametric  methods  of  identification  proceed  by  first  selecting  a  set  of  candidate  parametric  models 
of  the  form: 

y(t)  =  P(q,9)u(t)  +  W(q,0)v(t)  (3.18) 

where  v(t)  is  a  zero-mean  white  noise  sequence  of  intensity  A,  and  0  e  Rp  is  a  vector  of  model 
parameters  which  can  be  either  physical  or  canonical  as  previously  discussed.  Observe  that  if 
P(s,0)  is  the  continuous  time  transfer  function,  e.g.,  (3.8),  then  the  sampled-data  representation  is 

P(q,9)  =  (l-q-1)Z{-P(s,0)}  (3.19) 

<5 

•  .• 

Practically  every  parameter  estimation  scheme  is  based  on  developing  an  on-line  or  off-line 
procedure  for  selecting  the  model  parameters  0  so  as  to  minimize  some  function  of  the  prediction 
error , 3 

t[t,9)  =  w-\q,9)[v(t)  -  P(q,8Mi))  (3-2°) 

>W( e  use  <  to  denote  tun  pie  time®,  i.e.,  t  =  0, 1,2,...,  etc;  q  is  the  shift  operator  where  }*(<)  *  *(t  +  1)  and 
q~lx(t)  =  x (t  -  1).  Also,  we  use  the  term  transfer  function  to  denote  an  operator  which  depends  on  q. 

*In  this  form  the  prediction  error  is  optimal  given  the  noise  assumptions,  i.e.,  the  prediction  error  is  the  “innova¬ 
tions*  sequence  corresponding  to  the  steady  state  Kalman  filter  tuned  to  the  model  and  noise  statistics.  If  nothing 
is  known  about  i /(<),  i.e.,  v(t)  is  bounded  but  otherwise  unpredictable,  then  the  predictor  has  the  property  that  as 
t  —  oo,e(«,0)  —  v(t). 
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3.4.3  Least-Squares  Criterion 
For  example,  the  least-squares  estimate  is 


6n  =  arg  min  V/sr(0) 


(3.21) 


(3.22) 


where  T  is  a  subset  of  Rp  which  consists  of  those  constant  6  which  stabilize  the  predictor,  and  L(q) 
is  a  stable  filter. 

Techniques  for  computing  Bjq  involve  either  iterative  or  recursive  algorithms,  see,  e.g.,  Ljung 
and  Sodestrom(1983),  Ljung(1987).  No  matter  how  it  is  computed,  it  can  be  shown  that  for  large 
N , 

On  w  arg  min  J  ^Qs(u,0)\P(eju/,  9)  -  du  (3.23) 


$*(„,#)  = 


(3.24) 


where  Yjv( w)  and  Un(u)  are  the  discrete  Fourier  transforms  of  u(t)  and  y(t),  respectively,  for 
t  =  1,. .. ,N.  Thus, 

v(>  )«•'"'  (3.25) 


and  the  same  for  Vn(u).  The  above  frequency  domain  expressions  support  the  interpretation 
that  P(eju,,0)  is  the  best  quadratic  fit  of  the  empirical  transfer  function  estimate  Yn(u)/Un(u>), 
under  the  weighting  function  Qjv(v,0).  Clearly  this  weight  can  be  effected  by  the  input  spectrum 
Un( <*>),  the  data  filter  L(e^u),  and  the  noise  model  W(e^w,0).  These  choices  can  be  used  to  design 
the  estimation  experiment  so  as  to  achieve  the  smallest  possible  model  error  as  needed  by  control 
design. 


3.4.4  Least-Peak  Criterion 


The  estimation  criterion  can  also  be  taken  as 

VN{0)=  sup  |L(g)e(t,0)|  (3.26) 

<€i . N 

Estimators  based  on  this  criterion  are  known  to  be  much  sharper  when  the  error  is  small,  i.e.,  they 
converge  more  rapidly  than  the  least-squares  estimate.  Unfortunately,  computational  techniques 
for  computing  this  estimate  have  not  been  fully  developed. 


3.4.5  Computing  Model  Error:  The  Mean-Square-Error  (MSE) 

Suppose  we  ha  ;  computed  0/v.  The  next  step  is  to  form  the  transfer  function  estimates  P(q,6\ r) 
and  W(q,0j<i).  It  is  important  when  using  these  estimates  in  control  design  to  be  confident  of 
their  accuracy  a  function  of  N,  the  data  record  length.  For  example,  if  lim/v—oo  0^  =  0.,  then 
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we  would  like  to  know  the  convergence  rate  as  well  as  the  properties  of  the  limiting  estimates 
with  respect  to  the  true  system  P0(q),W0(q).  A  knowlege  of  the  mean-square- 
error  (MSE)  between  the  estimated  transfer  function  and  the  true  system  transfer  function  would 
determine  the  requisite  robustness  of  control  that  is  required  to  use  the  transfer  function  estimate 
for  control  design. 

Expressions  for  the  MSE  are  in  general  quite  complicated.  It  is  shown  in  Ljung(1987),  under 
fairly  weak  conditions  on  the  model  set,  that  with  probability  one, 

lim  §N  =  9 ,  (3.27) 

N—*oo 

where 

(3.28) 

(3.29) 

Moreover,  the  quantity 

>/N(9n  -  9.)  (3.30) 

is  asymptotically  normal  with  zero  mean  and  covariance  matrix 

R.  =  [V'Wr1  Jim  £{N[Vj,(9.))TVMe.)}lV"(9.T1  (3.31) 

JV— » oo 

where  '  and  "  denote  differentiation  with  respect  to  9 ,  once  and  twice,  respectively.  Thus,  for  large 
N,  we  have 

.  •  -  '£{(9„  -  4.)(fo  -  O-f) «  jR.  (3.32) 

Hence,  the  MSE  for  model  error  is,  for  large  N: 

£{\ P(e*JN)  -  P0(^)|2}  *  | P(ei“,9.)  -  P0{^)\2  +  jP,(^,9.)RmP'(^y9.)T  (3.33) 

This  is  a  very  important  result,  because  it  provides  an  analysis  tool  for  computing  the  estimated 
transfer  function  accuracy  directly  in  terms  of  the  transfer  function  for  any  parametrization.  Hence, 
any  proposed  estimator  whose  purpose  is  to  asymptotically  produce  an  estimate  which  minimizes 
V(9),  can  be  evaluated  independently  from  how  the  estimate  is  produced.  In  fact,  the  term  above 
which  decays  as  1/N  is  the  best  that  can  be  achieved  by  any  on-line  or  off-line  algorithm. 


9m  =  argminV(0) 
m  =  £[£\t,9)} 


3.4.6  Non- Parametric  Methods  of  IVansfer  Function  Estimation 

Parametric  methods  as  we  have  discussed  them,  involve  transfer  function  models  which  depend 
on  either  physical  or  canonical  parameters  in  a  specified  way.  Methods  which  do  not  require  such 
structural  knowledge  or  assumptions  are  referred  to  as  non-parametric.  For  example,  suppose  that 
the  only  assumption  is  that  the  system  has  a  transfer  function  which  is  smooth  in  some  sense. 

Available  methods  for  non-parametric  transfer  function  estimation  rely  on  estimating  spectral 
densities.  The  methods  are  based  on  time-series  and  Fourier  analysis  of  finite  data  sequences.  There 
are  many  excellent  textbooks  on  the  subject,  e.g.,  Jenkins  and  Watts(1968),  Brillinger(1975),  and 
Priestly  (1981). 
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Just  as  in  the  parametric  case,  it  is  also  possible  to  obtain  expressions  for  the  MSE  for  model 
error.  A  typical  approximation  for  large  N  is 

£{\P(et")  -  P0(O|2}  *  M2(7)|*(u,)|2  +  jjL(i)Sdd(w)/ SUu(u)  (3.34) 

where  P(e^)  is  the  estimate  of  P0(e^)  obtained  using  spectral  estimation  techniques,  and  where 

R(u)  =  P'V")  +  Kie’nSM/Suui")  (3.35) 

with  '  and  "  here  denoting  differentiation  with  respect  to  w,  once  and  twice,  respectively.  Also 

M(7)  =  f2ru>2Wy(u)dw  (3.36) 

Hi)  =  flrW^du  (3.37) 

where  W7(w)  is  the  lag  window  of  width  l/7.  The  window  is  used  to  generate  “smooth”  spectral 
estimates  and  as  seen  in  the  above  expressions  can  be  used  to  adjust  the  MSE.  Typically,  as  7 
increases,  the  window  becomes  more  narrow,  M(j)  decreases,  and  L( 7)  increases.  Thus,  as  7 
increases,  the  first  term  (the  bias)  decreases,  but  the  second  term  (the  variance)  increases.  Clearly 
for  large  N  there  is  an  optimal  choice  of  lag  window  width  to  minimize  the  MSE  for  fixed  N,  and 
this  can  be  calculated;  see  Ljung  (1997). 

.  •* 

3.5  Application  to  LSS 

The  number  of  actual  physical  parameters  in  an  LSS  is  extremely  large.  Direct  parametrizations 
involving  these  parameters  is  out  of  the  question  from  a  practical  view.  For  example,  consider  the 
following  two  modeling  approaches:  (1)  finite  element  modeling,  and  (2)  continuum  modeling. 

3.5.1  Finite  Element  Modeling 

A  typical  finite  element  model  has  the  form 

M(6)q(t)  +  K(9)q(t)  =  Bu{t)  ,  y(t)  =  Cg(t)  (3.38) 

where  q(t)  €  R*  is  a  vector  of  nodal  displacements  (l  is  large),  and  y(t)  and  u(t)  are  the  output  and 
input  vectors,  respectively.  The  lx  l  matrices  M(0)  and  K(9)  are  the  mass  and  stiffness  matrices, 
respectively.  The  matrices  B  and  C  depend  only  on  the  location  of  the  point  actuators  and  point 
sensors  and  do  not  depend  on  the  parameter  vector  6  €  Rp.  The  elements  of  6  consist  of  various 
lengths,  areas,  mass  densities,  stiffness  values,  etc.  If  is  a  nominal  value,  then  for  small  parameter 
deviations,  the  mass  and  stiffness  matrices  can  be  approximated  by  [see,  e.g.,  McIntosh(1986)] 

M(9)  «  Mo  +  -  Oio)miMi 

i= 1 

K(0)  *  + 

1=1 

Suppose  that  the  matrices  : «  =  0,...,p}  and  integers  :  »  =  1,.  ..,p}  are  known. 

Then,  the  physical  parameters  can  be  taken  as  the  parameter  deviations  from  the  nominal,  i.e., 
{9i  -  $io  :  i  =  0,...,p) 


(3.39) 

(3.40) 
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3.5.2  Continuum  Modeling 


It  is  often  possible  to  approximate  a  complicated  structure  with  a  simpler  structure.  For  example, 
consider  the  parametric  continuum  model 

m(x,0)zu(t,z)  +  [p{x,0)zxx(t,x)]xx  =  /(t,x)  (3.41) 

where  here  we  take  m(x,0)  as  the  mass  density,  p(x,0)  as  the  elastic  modulus,  and  f(t,x)  as  the 
force  distribution  on  the  structure.  Observe  that  the  parameter  vector  6  is  used  to  adjust  the 
spatially  varying  mass  density  and  elastic  modulus.  In  our  simple  tortional  rod  example  we  have 
assumed  uniformity,  and  thus,  the  parametrization  is  given  by  m(x,0)  =  6\ ,  p(x,02)  =  02,  with 
the  parameter  vector  0T  =  (0i,02).  More  sophisticated  functions  will  undoubtedly  be  necessary  to 
model  LSS.  For  example,  cubic  spline  functions  are  examined  in  Banks  and  Crowley(1981,1982). 

,  .  i  ' 

3.5.3  Spatial  Measurements 

Consider  again  the  wave  equation  (3.5)  with  boundary  conditions  (3.6).  If  measurements  are 
available  of  r«(t,x)  and  zxx(t,x)  at  a  point  x  =  x0  on  the  structure,  then  a  natural  choice  for  the 
prediction  error  is 

£(t,0)  =  zu(t,x0)  -  02*xx(t,xo)  (3.42) 

Clearly  then,  estimation  of  the  physical  parameters  is  direct.  The  problem  is  that  the  assumed 
measurements  are  not  available.  These  could,  however,  be  estimated  from  position  and  rate  mea¬ 
surements  along  the  structure.  Suppose  we  have  measurements  of  z(t,x0),z(t,x0  ±  A),zt(t,x0), 
where  A  is  a  small  spatial  distance.  Then,  an  estimate  of  zu(t,  xQ)  is 

Xtt(tiX0)  —  Zt(t,  x0)  (3.43) 

and  an  estimate  of  zxx(f ,  x„)  is 

x**{i,Xo)  =  ^[z{t,x0  +  A)  -  2z(t,xa)  +  z(t,z0  -  A)]  (3.44) 

The  prediction  error,  based  on  measurements,  which  can  now  be  used  in  the  estimation  algorithm 
is 

c(t,0)  =  0iZu(t,xo)-  02zxx{t,xo)  (3.45) 

Thus,  the  use  of  spatial  measurements  can  simplify  the  physical  parametrization  of  transfer  function 
models.  We  remark  that  the  spatial  measurements  might  not  be  just  position  and  velocity  but  might 
involve  the  outputs  of  very  many  strain  gauges  placed  appropriately  on  the  structure. 


3.6  Uncertainty  Estimation 

The  basic  idea  is  to  postulate  a  parametric  model  with  two  types  of  parameters,  say  8ph  €  R,;  and 
9aux  €  It*.  The  IP*1  parameters  correspond  to,  say,  physical  parameters  in  the  usual  sense  of  model 
building,  but  the  0aur  parameters  are  auxiliary  and  used  to  account  for  poorly  known  aspects  of 
the  system  dynamics.  To  illustrate  the  point  further,  let  the  parametric  transfer  function  model 
P(q,0)  have  the  decomposition 


P(q,0)  =  P(q,<rh)P(q,0"*) 


(3.46) 


Figure  3.1:  A  generic  parametric  model  structure 


where  P(q,9vh)  is  the  usual  parametric  model  and  where  P(q,0aux)  is  a  simple  parametric  structure 
which  accounts  for  unmodeled  dynamics. 

Obviously  more  elaborate  forms  can  be  established  depending  on  the  kind  of  a  priori  knowl¬ 
edge  available  regarding  the  location  and  type  of  unmodeled  or  poorly  known  dynamics.  A  pro¬ 
posed  generic  form  is  shown  in  Figure  3.1,  where  G(q)  is  a  known  interconnection  transfer  matrix, 
K(q,6ph)  contains  all  the  known  parametric  stucture,  and  L(q,0aux)  represents  the  poorly  known 
dynamics.  The  true  plant  would  be  represented  by  the  corresponding  triple  G0(q),  K0(q),L0(q). 

This  model  is  useful  for  control  design  if  the  parameters  0ph  and  9aux  can  be  separately  calculated 
as  in  Kosut(  1987a, b).  Then  a  non-parametric  method  can  be  used  to  estimate  the  model  error 
between  the  estimated  system  G(q),K(q,0ph),L(q,0aux)  and  the  true  system  G(q),K0(q),L0(q). 
Moreover,  the  variance  of  parameter  estimates  can  be  utilized  in  assessing  the  structure  of  parameter 
uncertainty.  When  this  procedure  is  coupled  together  with  the  previously  discussed  controller 
sysnthesis  methods  ,  we  then  have  an  on-line  robust  control  design  scheme,  what  is  referred  to  as 
adaptive  calibration. 


3.7  Computing  the  Estimate 


In  general,  adaptive  algorithms  share  a  similar  genesis  and  purpose,  namely  to  adjust  a  parameter 
vector  0(t)  to  asymptotically  approach  a  member  of  the  set 

fopt  =  {*  €  Rp  :  9  =  arg  min  J(9)}  (3.47) 

9 


where 


J{9)  =  avg{f2(t,0)} 


(3.48) 
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(3.49) 


with  the  averaging  operator  defined  by 

avg{*(t)}  =  lim  i  fTX(t)dt 

T  — *oo  1  Jo 

The  adaptation  error  s(f,0)  is  formed  with  0(f)  held  constant  at  0.  This  error  signal  can  be  the 
prediction  error  as  described  before  in  (3.20),  or  can  be  obtained  from  the  adaptive  feedback  system 
in  Figure  2.3  as  the  tracking  error 

£(f,0)  =  y(f)|^)=tf-r(f)  (3.50) 

In  the  ideal  case,  the  parametrization  can  be  selected  so  as  to  achieve  what  is  called  perfect 
matching ,  i.e.,  for  some  restricted  class  of  exogenous  inputs  (usually  no  disturbances),  and  a  suffi¬ 
cient  number  of  parameters  (no  unmodeled  dynamics),  the  set  has  a  single  member  such  that 
lim<_oo  s(f ,  0)  — *  0  (or  white  noise  in  the  stochastic  case).  This  is  usually  never  the  case. 

Under  more  realistic  conditions,  the  best  that  can  happen  [see,  e.g.,  Anderson  et  al.(1986)]  is 
that  0(f)  asymptotically  approaches  a  small  neighborhood  of  a  local  minima  of  7(0),  i.e.,  the  set 

r.  -  {0  €  R”  :  ^7(0)  =  0,  —7(0)  >  0}  (3.51) 

3.7.1  Gradient  Algorithms 

For  the  purposes  of  illustration  we  will  assume  that  parameter  adjustments  are  continuous.  In 
practice  the  adjustments  are  at  discrete  times.  Then,  assuming  continuous  adjustments,  one  method 
to  achieve  0(f)  — *  Fopt  is  to  adjust  0(f)  in  the  negative  gradient  direction  of  the  instantaneous 
squared  error,  that  is,  let 

(3.52) 

Hence,  the  parameter  adaptive  algorithm  is 

6(t)  =  (3.53) 

where  7  is  a  positive  constant,  and 

c(f)  —  £{t, 
t&(t)  =  V>(f,0)|,=*-(t) 

with  the  error  gradient 

3.7.2  Gauss-Newton  Algorithm 
If  the  parameter  vector  is  adjusted  to  account  for  second  order  effects,  then  the  resulting  algorithm 

1(f)  =  R~\t)m£(t)  (3-57) 

where 

R(t)  =  0(f)0T(f)  (3.58) 

This  algorithm  has  the  same  form  as  the  recursive-least- squares  (RLS)  algorithm.  The  matrix  R{t) 
is  approximately  the  inverse  of  the  covariance  of  the  parameter  error. 


(3.54) 

(3.55) 

(3.56) 
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3.7.3  Constructing  the  Error  Gradient  from  Measured  Signals 

A  difficulty  with  constructing  the  above  gradient  algorithm  is  that  the  error  gradient  0(1,0)  is  a 
function  of  the  true,  but  unknown  plant.  The  actual  algorithm  uses  an  approximation  0(1,0),  and 
thus 

0(1)  =  70(1)£(<)  (3.59) 

where  0(1),  referred  to  as  the  regressor,  is  given  by 

0{1)  =  0(1,0)|^(1)  (3.60) 


For  example,  consider  the  wave  model  of  the  uniform  rod  in  (3.8),  which  can  also  be  expressed 
as 

<*[y(0  -  V(t  ~  2/3)]  =  u(l)  +  u(l  -  2/3)  (3.61) 

The  parameter  vector  16  0T  —  (a  ,  /?).  Let  the  adaptive  error  signal  be  the  equation  error 

e(t,  0)  =  o[y(l)  -  y(t  -  2/3)]  -  u(l)  -  u(l  -  2/3)  (3.62) 


The  negative  gradient  of  the  error  is  then 


0(1,0) 


(  ~ea(t,9)  )_(  y(l-2/3)-y(l) 

\  \  -2[ay(l  -  2/3)  +  ti(!  -  2/3)] 


(3.63) 


If  2/(1)  and  u(l)  are  not  available,  then  the  actual  gradient  0(1,0)  can  be  approximated  by 


0(t  e)  - 1  "i  - 1  y{tr 2 ~ y(<) 

]  ~  V  -«/»(*.»)  )~\-  fe)  [«»(i  -  2/3)  +  u(l  -  2)8)] 
where  the  filter  time  constant  r  is  a  design  choice. 


(3.64) 


3.7.4  Algorithm  Modifications 

To  insure  that  adaptive  algorithms  are  working,  one  often  includes  various  “safety  nets”  or  AI  fea¬ 
tures.  For  example,  in  uncertainty  estimation,  the  controller  is  not  changed  until  some  level  of  con¬ 
fidence  is  established.  Other  AI  features  include  monitoring  of  information,  covariance  monitoring, 
perisitent  excitation  cpnditioning,  and  parameter  projection,  to  name  a  few.  Various  normalizations 
of  the  measured  sugnals  also  enhance  the  ability  of  adaptive  systems  to  insure  bounded  parame¬ 
ter  values.  These  latter  approaches  need  prior  information  regarding  model  error  and  parameter 
ranges. 

As  an  example,  consider  the  Gauss-Newton  algorithm  with  “covariance”  resetting.  That  is,  set 

R(ti)  t-.Ro  =  rZ>0  (3.65) 

where  the  reset  times  {!,}  are  found  frotr 

rt,~i+S  _ 

ti  =  lj_i  +  min{0  :  /  0(l)0~(l)dl  >  a}  (3.66) 

Jti-i 

where  to  =  0.  These  reset  times  are  essentially  those  where  sufficient  information  is  gathered. 
Usually  the  controller  is  adjusted  only  at  these  times,  i.e.,  controller  adjustment  takes  place  when 
the  model  estimate  is  good. 
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3.7.5  Performance  of  the  Algorithm 


Despite  the  above  approximation,  from  a  practical  point  of  view  it  is  certainly  acceptable  that  the 
parameters  approach  and  remain  in  a  small  neighborhood  of  provided  that  members  of  this  6et 
also  produce  acceptable  performance. 

Assuming  this  is  so,  let  0.  €  Rp  denote  such  a  setting,  of  which  there  could  be  many.  We  refer 
to  each  0,  as  a  tuned  parameter  and  to  the  corresponding  feedback  system  as  the  tuned  system  [see, 
e.g.,  Kosut  and  Friedlander(1985)].  Clearly  the  tuned  system  is  the  feedback  system  above  with 
0(t)  fixed  at  0..  We  can  now  pose  the  following  questions  regarding  the  adaptive  system  : 

1.  How  do  the  tuned  parameters  depend  on  the  exogenous  inputs? 

2.  Is  the  adaptive  system  6table  in  a  neighborhood  of  the  tuned  system.  Furthermore,  how  small 
is  this  neighborhood? 

3.  What  is  the  region  of  attraction  to  a  small  neighborhood  of  the  tuned  system. 

4.  What  is  the  rate  of  convergence  to  this  small  neighborhood. 

Some  of  these  questions  can  be  answered  by  the  method  of  averaging  which  is  discussed  in  the 
following  section. 


3.8  Averaging  Analysis  of  Stability  and  Convergence 

3.8.1  Method  of  Averaging 

The  classical  method  'of  averaging  applies  to  a  differential  equation  of  the  form 

*  =  7/(*,*) ,  7  >  0  (3.67) 

where  7  is  a  positive  constant.  If  7  is  sufficiently  small  then  under  suitable  regularity  conditions  on 
/(t,  x),  the  stability  of  the  above  time- varying  system  is  inherited  from  the  stability  of  the  simpler 
autonomous  system 

i  =  7  /«(*)  (3.68) 

where 

/«(*)  =  avg{/(*,x)}  (3.69) 

This  system  is  referred  to  as  the  averaged  system.  Observe  that  for  fa{x)  to  exist,  some  other 
restrictions  must  also  apply  to  f(t,x),  e.g.,  f(t,x)  is  almost  periodic  in  t  uniformly  for  x  in  a 
compact  set;  although  weaker  conditions  can  be  stated. 

The  above  type  of  result  is  certainly  expected  on  intuitive  grounds,  provided  that  /(t,x)  is 
smooth  enough  and  the  average  value  /a(x)  exists.  A  precise  formulation  of  the  stability-instability 
conditions  is  given  in  Theorem  V.3.1,  Hale(1969). 

In  the  adaptive  system  we  examine  the  “average”  parameter  trajectories  ensuing  from 

0«  =  7/(«.)  (3.70) 
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(3.71) 


where  the  adaptive  parameter  vector  field  is  given  by 

f(6)  =  avg{4>(t,0)£(t,0)} 

The  question  is  then  to  determine  conditions  under  which  the  behavior  of  the  actual  parameter 
trajer  ories  is  like  that  of  the  average  parameter  trajectories.  Using  the  above  averaging  theory, 
analysis  of  the  adaptive  system  can  be  divided  into  answering  questions  about  its  asymptotic  and 
transient  characteristics. 

3.8.2  Asymptotic  Analysis 

The  definition  of  the  tuned  system  as  well  as  the  stability  of  the  adaptive  system  in  the  neighborhood 
of  the  tuned  system  can  be  answered  by  Theorem  4.2  in  Anderson  et  al.(1986)  or  Section  3  in  Bodson 
et  al.(1985).  The  flavor  of  these  results  can  be  stated  as  follows: 

Theorem  1  Let  the  tuned  parameter  set  be  defined  as  those  0.  £  Rp  which  stabilize  the  feedback 
system  and  which  also  satisfy 

avg{/(M.)}  =  0  (3-72) 

If  the  functions  f(t,6),<t>(t,0)  ande(t,6)  are  sufficiently  smooth,  almost  periodic  in  t  uniformly  for 
0  in  compact  sets,  then  for  a  sufficiently  small  adaptation  gain  7,  and  a  sufficiently  small  peak 
value  of  the  tuned  error  signal  e(t,0m),  solutions  of  the  adaptive  system  originating  in  a  small 
neighborhood  of  the  tuned  solutidns  0.,<f>(i,0.),  and  e(t,0.),  will  remain  there  if 

maxReAj[i?(0,)]  <  0  (3.73) 

t 

and,  moreover,  will  leave  there  if 

maxReAi[J9(0»)]  >  0  (3.74) 

» 

where  the  matrix  function  0  >-►  B(0)  is  given  by 

B(0)  =  ^avg{/(«,0)}  (3.75) 

In  addition,  if  the  tuned  solution  is  stable,  that  is,  if  max,  Re  A,  [5(0, )]  <  0,  then 

limsup  ||0(t)  -  0.||  =  0(limsup  je.(t)|)  + O(t)  (3-76) 

limsup(|r7(t)U  =  0(  7)  (3.77) 

<— >00 


The  type  of  smoothness  conditions  required  of  the  functions  f(t,0),$(t,$),  and  £(t,  0)  is  not 
severe,  e.g.,  continuity  in  t  and  Lipschitz  continuity  in  6  for  0  in  compact  sets. 

The  sharp  stability-instability  boundary  expressed  above  allows  not  only  for  an  assessment  of 
a  particular  design,  but  also  indicates  how  to  modify  and  improve  the  algorithm. 

3.8.3  Transient  Analysis 

An  understanding  of  the  transient  properties  of  the  adaptive  system  requires  answering  the  ques¬ 
tions  posed  before,  namely,  determining  the  region  of  attraction  to  a  small  neighborhood  of  the 
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tuned  system,  and  the  rate  of  convergence.  The  folowing  result,  based  on  averaging,  can  be  found 
in  Anderson  et  al.(1986). 

Theorem  2  Under  the  conditions  stated  in  Theorem  1,  if  the  initial  parameter  value  60  is 
strictly  inside  a  convex  subset  of  the  constant  parameter  stability  set,  then  6(t)  will  remain  in  that 
set  and  converge  exponentially  at  a  rate  no  slower  than  0( 7)  to  the  small  neighborhood  of  the  tuned 
system  defined  in  Theorem  1. 

Some  of  the  restrictions  can  be  relaxed,  see,  e.g.,  Riedle  and  Kokotovic(1986)  or  Bodson  et 
al.(1985).  Specifically,  the  region  of  attraction  can  be  relaxed  to  a  compact  subset  of  the  constant 
parameter  stability  set;  also,  estimates  of  the  rate  of  convergence  far  from  the  tuned  setting  do  not 
have  to  be  0(7). 

These  type  of  results  are  pleasing  from  an  intuitive  point  of  view,  thus  providing  a  qualitative 
analysis.  However,  they  do  not  apply  when  the  parameters  leave  the  constant  parameter  stability 
set,  nor  do  they  provide  quantitative  performance  measures.  In  addition,  we  still  require  slow 
adaptation,  which  in  itself  is  not  too  objectionable,  perhaps  only  cautious,  but  again  specific  bounds 
on  the  adaption  gain  extracted  from  the  theory  tend  to  be  extremely  conservative. 
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Adaptive  Control  of  Large  Space  Structures: 

Uncertainty  Estimation  and  Robust  Control  Calibration1 

Robert  L.  Kosut  2 
Abstract 

An  approach  is  presented  to  the  problem  of  designing  a  robust  control  using  on-line  mea¬ 
surements.  The  idea  is  to  use  standard  methods  of  parametric  system  identification  to  obtain 
a  nominal  estimate  of  the  plant  transfer  function.  Non-parametric  spectral  methods  are  then 
used  to  obtain  a  frequency  domian  expression  for  model  uncertainty.  If  the  model  uncertainty 
exceeds  a  specified  frequency  bound,  which  has  been  predetermined  from  the  nominal  model 
and  the  performance  criteria,  then  data  filters  used  in  the  system  identification  are  modified 
and  the  procedure  is  repeated.  An  analysis  is  presented  which  establishes  conditions  under 
which  the  procedure  will  actually  converge  to  a  satisfactory  robust  design.  An  example  is 
provided  which  illustrates  the  method  and  supporting  analysis. 

1  Introduction 

Large  space  structure  (LSS)  systems  impose  stringent  performance  demands,  and  hence,  feedback 
designs  will  of  necessity  be  based  on  very  accurate  models.  However,  the  on-orbit  dynamics 
will  not  be  sufficiently  like  those  obtained  from  either  ground- testing  or  even  from  sophisticated 
computer  generated  modeling  techniques,  e.g.,  finite  element  modeling.  Therefore,  it  is  necessary 
to  be  able  to  identify  che  LSS  dynamics  directly  from  on-orbit  measurements,  and  simultaneously, 
time  or  re-design  the  control.  Hence,  the  control  design  cycle  is  an  adaptive  process,  typically 
starting  with  a  nominal  low-performance  design  based  on  a  coarse  model,  and  then  re-designed 
from  on-orbit  data. 

This  paper  addresses  some  of  the  issues  involved  in  the  adaptive  control  of  LSS  systems, 
specifically  the  problem  of  vibration  suppression. 

1.1  Adaptive  Control 

Adaptive  control,  as  depicted  in  Figure  1,  essentially  consists  of  two  processes,  namely:  (1)  a 
model  parameter  estimator  which  uses  a  finite  data  record  of  input-output  measurements,  and 
(2)  a  control  design  rule  which  transforms  the  model  parameter  estimates  into  control  parameters. 

The  procedure  can  go  awry  mainly  because  the  model  estimate  is  not  sufficiently  accurate  and 
the  control  design  rule  effectively  assumes  that  the  estimates  are  perfectly  accurate.  Although 
the  procedure  may  work  well  whenever  the  parameter  estimates  are  in  a  good  region,  they  may 
never  get  there. 

One  route  around  this  problem  is  to  prove  that  it  can  never  occur,  which  involves  analyzing  the 
complete  adaptive  system.  For  example,  analysis  techniques  based  on  the  method  of  averaging 
can  provide  an  assessment  of  the  adaptive  systems  robustness,  e.g.,  Anderson  et  al.(1986). 

‘Research  supported  by  AFOSR,  Directorate  of  Aerospace  Sciences,  under  contract  F49620-85-C-0094. 

JR.L.  Kosut  is  a  Senior  Scientist  with  Integrated  Systems,  Inc.,  2500  Mission  College  Bivd.,  Santa  Clara,  CA 
95054,  and  a  Consulting  Professor  with  the  Information  Systems  Lab,  Stanford  U..iversity,  Stanford,  CA  95305. 
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Figure  1:  Adaptive  Control 


Another  route,  explored  here,  is  to  modify  the  process  so  that  along  with  estimating  model 
parameters,  a  measure  of  model  uncertainty  is  also  obtained.  With  such  an  estimate  in  hand 
the  control  design  rule  can  be  modified  to  account  for  the.  model  uncertainty.  If  the  resulting 
controller  is  too  cautious,  because  the  model  estimate  is  too  uncertain,  then  it  is  necessary  to 
repeat  the  estimation  procedure  so  as  to  obtain  a  better  model. 

In  this  paper  we  will  concentrate  on  a  particular  procedure  for  obtaining  a  measure  of  model 
uncertainty  for  linear  time  invariant  plants.  The  process  is  referred  to  as  adaptive  uncertainty 
modeling  and  the  outcome  is  a  frequency  domain  expression  for  the  model  uncertainty.  The  mo¬ 
tivation  arises  from  robust  control  theory  which  utilizes  frequency  domain  expressions  for  char¬ 
acterizing  a  set  of  uncertainty  within  which  lies  the  true  plant,  e.g.,  Francis  and  Zames(19S4), 
Safonov  et  al.(1981),  Doyle  and  Chu(19S6),  Vidyasagar(1985).  An  adaptive  control  system,  mod¬ 
ified  so  that  it  produces  a  set  of  uncertainty,  is  depicted  in  Figure  2. 
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Figure  2:  Adaptive  Control  with  Uncertainty  Estimator 


The  indicated  adaptive  process  is  referred  to  here  as  adaptive  calibration,  Kosut(19S6,1987). 
The  set  of  uncertainty  is  extracted  from  input-output  measurements.  We  will  describe  one  ap¬ 
proach  to  this  problem  which  uses  a  combination  of  parametric  prediction  error  methods  together 


with  standard  non-parametric  (spectral)  estimation  methods.  We  then  show  how  the  set  of  uncer¬ 
tainty  produced  by  the  modified  identifier  allows  for  performance  evaluation  before  the  controller 
is  adjusted.  If  the  predicted  performance  is  not  satisfactory  then  the  identification  process  is 
repeated  under  different  conditions  which  will  reduce  model  uncertainty  where  needed.  The  re¬ 
sulting  controller  is  designed  to  be  robust  with  respect  to  the  estimated  set  of  uncertainty.  Thus, 
a  large  set  of  uncertainty  requires  a  cautious  or  low  authority  controller,  whereas  a  small  set  of 
uncertainty  will  result  in  a  high  authority  controller.  Observe  that  the  order  of  the  controller  can 
vary  with  the  set  of  uncertainty. 

This  paper  is  organized  as  follows:  Section  2  develops  the  sampled-data  linear  transfer  function 
model.  Section  3  states  some  known  results  regarding  stabilization  and  robustness  of  linear 
control  systems.  Section  4  is  the  main  section,  decribing  in  detail  an  approach  to  estimating 
the  set  of  uncertainty  and  a  procedure  for  adaptive  calibration  for  both  open-loop  and  closed- 
loop  situations.  Section  5  presents  an  example  of  uncertainty  estimation  from  experimental  data. 
Section  6  contains  concluding  remarks. 


2  Linear  Flexible  Dynamics 


2.1  Transfer  Function  Model 


Under  the  assumptions  of  linear  elasticity  and  small  deflections,  at  any  time  t  and  position  r  on 
the  structure,  the  deflections  are  given  by 

v(*»r)  =  I3»I*(OlMO  (1) 

k*0 

where  xpk(r)  is  the  kth  mode  shape  at  position  r,  and  rjk(t)  is  the  kth  modal  amplitude  as  a 
function  of  time.  The  upper  limit  /  in  the  above  sum  is  theoretically  infinite,  but  for  all  practical 
purposes  can  be  considered  here  to  be  finite,  but  extremely  large.  Assuming  there  are  point 
actuators  located  at  the  discrete  positions  r«i,. . . ,rom>,  then  the  modal  amplitudes  each  satisfy 

771 0 

tf*(0  +  2C*ft*7*(0  +  fi*»?*(0:=5Z«.(0V,k(**««),  1  =  1,2,...,/  (2) 

is] 


where  flk  and  £*  are  the  kth  modal  frequency  and  damping,  respectively,  and  where  u,(<)  is  the 
ith  actuator  force  or  torque  on  the  structure.  A  position  sensor  located  at  r  would  measure 

Vp(*.r)  =  ]Cflfc(OlMr)  (3) 

fc=0 

Likewise,  a  velocity  sensor  at  r  would  measure 

Vv(*,r)  =  (4) 

*s0 


Suppose  that  actuators  are  placed  at  r0i, . . .  ,rom.,  position  sensors  at  rpi, . . . , rpm,,  and  velocity 
sensors  at  rVil, . . . ,  ru>m..  Then,  the  transfer  function  from  the  ith  applied  force  to  the  jth  position 
or  velocity  sensor  output  is  then,  respectively 


-  'EMrr})Mr.i)p  +  + 


(5) 

(6) 

(7) 


For  purposes  of  illustration,  suppose  that  there  is  only  one  position  sensor  at  rp  and  one  actuator, 
not  located  at  the  same  position  as  the  sensor,  but  at  r0.  Let  y(t)  denote  the  output  of  the 
position  sensor.  Since  disturbance  forces  act  on  the  structure  and  the  sensors  have  some  noise 
component,  the  actual  sensor  output  will  read  3 


y(t)  =  G(s)u(t)  +  d(t) 


(8) 


where  d(t )  is  the  cumulative  effect  of  disturbances  and  noise  sources  as  seen  at  the  output,  and 
where  G(s)  is  the  transfer  function  from  actuator  to  sensor  location.  Thus, 


where 


_ 0k _ 

sJ  +  2C  kilks  + 


0k  =  t/>*(*>)^*(r«) 


(9) 

(10) 


2.2  Sampled-Data  Representation 


Assume  that  control  commands  and  data  acqusition  occur  at  discrete  sampling  instances  separated 
by  uniform  sampling  intervals  of  duration  tta  =  l/Cl,a  seconds,  where  Sl,a  is  the  sampling  frequency 
in  Hz.  Thus,  u(t)  is  constant  between  samples,  i.e.,  for  any  integer  Jb, 

u(t)  =  «(<*),  t  €  [t *,<*+,)  (11) 

where  tk  =  kt,a.  If,  in  addition,  the  sensor  output  is  sampled  at  the  same  instances,  then  the 
discrete-time  representation  of  (8)  is 


y{tk)  =  Po(q)u(tk)  +  d(tk) 


(12) 


where  Po(q)  is  the  zero-order  hold  equivalent  of  G(s),  expressed  as  a  function  of  the  shift  operator 
q  defined  by  qx(tk)  =  x(tk+i)  and  q~lx(tk)  =  Thus,  in  terms  of  the  usual  “z-transform” 

operator  Z(-),  we  have 

Po(?)  =  (1-9-1)2{F(s)/s)  (13) 


Using  (9)  gives 


Po(g)  =  E(1~9‘1)2{7 


0k 


k=  1 


3  S2  +  2 +  U2k 


) 


(14) 


In  order  to  simplify  the  notation  in  the  sequel,  time  will  be  normalized  with  respect  to  the 
sampling  instants.  That  is,  the  discrete-time  representation  of  the  LSS  will  be  written  compactly 
as 

y(0  =  -Po(gMf)  +  d(t)  (15) 

where  t  takes  on  integer  values  only,  i.e.,  t  =  1,2,...,  and  the  shift  operator  is  normalized  so 
that  qx(t )  =  x(t  +  1)  and  g-1x(<)  =  x(f  —  1).  We  will  take  (15)  as  the  true  LSS  system  and  the 
measured  data  record  as 

zN  =  =  l . N)  (16) 

3The  variable  s  is  used  to  denote  either  the  Laplace  transform  variable  or  the  dHTerenti&i  operator  d/dl,  depending 
on  the  context. 


2.3  Disturbance  Spectrum 


Assume  that  d(t)  is  a  zero-mean  sequence  with  spectral  density  S,u(u>),  where  u>  is  normalized 
frequency  in  the  interval  {— tt,  -jt].  The  spectral  density  is  defined  as  the  Fourier  transform  of  the 
auto- correlation  function  JLm(t),  that  is 

Sdd(u)  =  f;  Rdd(r)e-*~  (17) 

T=  — OO 

where,  following  Ljung(1987), 

+  (18) 

where  £{•}  is  the  usual  expectation  operator.  Observe  that  if  d(t)  is  either  a  stationary  stochastic 
process  or  a  periodic  deterministic  sequence,  then  the  auto-correlation  function  as  defined  above 
will  exist.  Such  sequences  are  said  to  be  quasi- stationary. 

A  convenient  representation  of  the  disturbance  is  that  it  is  the  output  of  a  stable  system  with 
transfer  function  H0(q)  and  input  w(t),  a  zero-mean  sequence  with  constant  spectrum  Swui(u>)  = 
ctq,  e.g.,  white  noise.  Hence,  the  auto-correlation  is  an  impulse  function,  and  furthermore 

S^)  =  <T2|ifo(eiw)|2  (19) 

3  Linear  Control  Design 

3.1  Stabilization  and  Robustness 

Before  designing  an  adaptive  control  we  first  consider  the  problem  of  designing  a  robust  non- 
adaptive  control  where  the  following  information  is  assumed  to  be  known:  (i)  a  nominal  model 
of  the  LSS,  and  (ii)  a  description  of  the  accuracy  of  the  model,  e.g.,  model  error  as  a  function  of 
frequency.  In  this  section  we  state  some  known  results  regarding  the  stabilization  and  robustness 
properties  of  feedback  systems  using  the  above  information. 

Consider  the  linear  scalar  feedback  system  depicted  in  Figure  3,  with  reference  input  r(t), 
disturbance  d(t),  control  actuator  input  u(t),  and  sensor  output  y(t).  The  transfer  functions  of 
the  plant  and  feedback  compensator  are  P(q)  and  C(q),  respectively. 


Figure  3:  Linear  Feedback  System 

Following  Vidyasagar(1985),  let  S  denote  the  set  of  transfer  functions  which  are  rational  in 
q -I  (ratios  of  polynomials  in  q~*  with  real  coefficients)  and  stable  (all  poles  strictly  inside  the  unit 


disc).  We  say  that  C  stabilizes  P  if  and  only  if  the  three  transfer  functions  C/(1+PC),  1/(1+PC), 
and  P/(  1  +  PC)  are  in  S.  We  now  state  the  following  known  result. 

Theorem  1 

(i)  Stabilization 

If  P  £  S  then  C  stabilizes  P  if  and  only  if  C  =  Q(1  -  PQ)~l  for  some  Q  £  S. 

(ii)  Robustness 

If 

P  =  P  +  A  (20) 

and  if  some  compensator  C  stabilizes  P,  then  it  also  stabilizes  P  if  and  only  if 

A(1  +  A Q)~'  £  S  (21) 

where  Q  =  C(  1  +  PC)~X  £  S. 

The  first  part  of  the  theorem  essentially  asserts  that  if  P  is  stable  then  the  set  of  all  rational 
compensators  that  stabilize  P,  denoted  by  S(P),  is  given  by 

S(P)  =  {Q(  1  -  PQ)~'  :Q£  S}  (22) 

Thus,  any  stabilizing  controller  must  produce  a  Q  which  is  in  S. 

The  second  part  of  the  theorem  provides  conditions  under  which  the  feedback  system  consisting 
of  the  nominal  plant  P  and  the  stabilizing  control  C,  which  can  be  thought  of  as  the  nominal 
closed-loop  system,  is  robustly  stable  to  a  dynamical  plant  perturbation  A.  Observe  that  in  this 
part  of  the  theorem,  both  the  nominal  plant  model  P  and  the  perturbation  A,  may  be  unstable. 
However,  when  A  £  S,  condition  (21)  reduces  to 


(1  +  A Q)-1  £  S 


a  sufficient  condition  for  which  is  that 


|A(e’")l  £  Vw  e  ■'’•’'I  (24) 

Clearly  then,  a  robust  control  can  be  designed  on  the  basis  of  a  nominal  transfer  function  model 
P(q)  and  an  upper  bound  on  the  model  error  as  a  function  of  frequency. 

It  is  convenient  to  define  the  set  of  uncertainty 

P(P,6)  =  {P(q)  £  S  :  |P(e^)  -  P(e>-)|  <  6(w),  Vu  £  [-*,*]}  (25) 

Thus,  for  any  function  6(u>)  >  jA(e}U)j,  we  have  P  £  P(P,  6).  Consequently,  any  control  C  which 
is  designed  to  stabilize  P  and  simultaneously  satisfy  S(u>)  <  l/|Q(e,w)|,  will  also  stabilize  the  true 
plant  P  =  P  +  A. 

3.2  Modal  Truncation 

Assume  that  the  transfer  function  estimate  P(q)  is  an  estimate  of  the  first  n  modes  of  the  LSS. 
Typically  n  is  not  too  large,  say  n  «  20,  and  of  course  n<f  Hence,  P(g)  is  an  estimate  of 


p.(?)  D1  9  +  + 


(26) 


(27) 


It  is  convenient  to  express  P,(q)  as  the  rational  function 


where 


ft(ff)  = 


B{qX) 

A(q,0.) 


A(q,6.)  =  l  +  ajg-1  + - I"a2n9_2n 

B(q,e.)  =  Kq-'  +  ...  +  b;nq-2n  (28) 

*.T  =  [«:•••  •  •  •  *>;„] 

The  transfer  function  coefficients  €  R4n  are  uniquely  determined  by  the  modal  parameters 
{f lk,PkXk  '•  fc  =  The  converse  is  not  true  unless  the  sampling  frequency  is  large 

enough,  i.e.,  if  7r  >  f2n/fi<a. 

By  analogy  with  the  above  desription  of  P.(q),  the  estimate  P(q)  can  be  expressed  as 


A,)- 2!*$ 

Mi,*) 

where  9  £  R4n  is  an  estimate  of  9..  Model  error  is  then 

A(q)  =  P.(q)-P(q)  +  P(q) 

where  P(q)  is  the  residual  mode  transfer  function  given  by 

fik 


Ht)~ 

k=n+l 


siJ  +  2  CiftfcS  +  fl* 


} 


(29) 


(30) 


(31) 


If  9  «  9W,  then  the  n-mode  estimate  P(q)  is  close  to  P„(g),  and  A (q)  «  P(g).  Thus,  a 
compensator  C(q)  which  stabilizes  P(q)  will  also  stabilize  the  true  system  Po{q)  provided  that 

|<?(e^)P(e^)|  <  I,  Vo;  €(-»,»)  (32) 

To  simplify  the  discussion  to  follow  suppose  that  the  sampling  frequency  Usa  is  sufficienly  large  in 
the  sense  that  <C  if.  Otherwise  anti-  aliasing  filters  have  to  be  included,  which  although 

they  may  ultimately  be  required,  the  discussion  here  would  be  obscured.  In  effect,  we  are  choosing 
an  arbitrarily  large  sampling  frequency. 

Now,  for  <  u;  <  7r,  the  control  can  be  designed  so  that  \Q(e*w)\  is  as  small  as  required. 

For  u>  <C  the  residual  modes  have  little  effect,  i.e.,  |P(e,w)|  is  small.  For  frequencies  near 

fln/0,o,  some  care  is  required  in  the  control  design.  However,  if  the  flexible  structure  is  crafted 
so  that  the  frequency  gap  —  ft„j  is  large,  then  most  any  controller  which  has  a  reasonable 

attenuation  for  frequencies  beyond  is  likely  to  be  robust  with  respect  to  the  unmodeled 

residual  modes.  This  latter  approach  is  preferable  even  though  the  controller  can  be  designed 
independently  from  the  structure. 

If  9  is  not  a  good  estimate  of  9m,  then  parameter  error  will  be  a  luge  contributor  to  model 
error  magnitude  |A(e**’)|.  In  this  case,  the  uncertainty  set  P(P,  |A|)  is  too  coarse  of  a  description, 
because  the  source  of  uncertainty  is  parametric  rather  than  unmodeled  high  frequency  dynamics. 
Thus,  any  compensator  designed  to  robustly  stabilize  this  set  of  uncertainty  will  be  unnecessarily 
cautious.  However,  the  resulting  closed  loop  system  may  still  satisfy  the  performance  goals.  As 
will  be  discussed  later,  it  is  possible  to  design  the  identification  experiment  to  preclude  large 
parameter  errors. 


4  Estimating  the  Set  of  Uncertainty 


The  problem  examined  in  this  section  is  to  estimate  a  set  of  uncertainty  P(P,  6)  from  the  data 
record  {y(t),  u(t)  :  t  —  1,.. .  ,N).  We  will  describe  one  approach  and  show  under  what  conditions 
the  true  plant  belongs  to  the  estimated  set  of  uncertainty.  In  this  case  it  follows  that  any  contol 
designed  to  robustly  stabilize  any  plant  in  the  estimated  set  of  uncertainty,  will  also  stabilize  the 
true  plant. 

4.1  Parametric  Model  Estimation 

We  begin  by  first  constructing  a  parametric  estimate  of  the  transfer  function  P0(g )  using  the 
prediction  error  formulation  in  Ljung(l985,1987).  Thus,  the  parameter  estimator  is  given  by 

0a  —  argmin  VV0,  a) 

tf€Z> 


VMa)  = 

■‘v  (=] 


where  V  is  a  subset  of  Rp  and  a  6  A  is  a  set  of  auxiliary  parameters  which  characterizes  a  stable 
filter  La(q),  where  A  is  a  subset  of  Rm.  The  purpose  of  the  auxiliary  parameter  set  will  be 
desribed  in  the  sequel.  The  sequence  e(t,6)  is  referred  to  as  the  prediction  error  and  La(q)e(t,  6) 
as  the  filtered  prediction  error.  I'xcept  for  the  auxiliary  parameters  a,  the  estimator  is  the  usual 
least  squares  estimator. 

Following  Ljung(  1985,1987),  the  prediction  error  is  obtained  from  the  parametric  model  set 

»(<)  =  P(q, *M0  +  H(q, 0)e(t),  0 even’  (34) 

where  e(t)  ia  an  unpredictable,  but  bounded,  function.  Thus,  the  prediction  error  becomes 

e(t,0)  =  H~\q,  6)[y{t)  -  P(q,0)u(t)]  (35) 

The  only  resriction  on  the  model  set  is  that  the  predictor ,  i.e.,  the  map  (y,  u)  t-t  e,  defined 
implicitly  above,  is  stable.  Thus,  we  require  that  the  operators  H^^q.B)  and  H~l(q,6)P{q,6 ) 
are  stable,  which  defines  the  elements  of  the  set  V.  The  reason  for  this  restriction  is  that  despite 
unknown  initial  conditions,  limsup,^oo£(t,0)  =  e(<),  which  is  obviously  the  best  the  predictor 
can  do  considering  the  unpredictability  of  e(<). 


4.1.1  Least  Squares  and  Linear  Regression 

The  model  set  we  will  use  here  is  the  equation  error  model  set 

^(9>  %(*)  =  B(<h  0)u(<)  +  e(0  (36) 

where  A{q,  6)  and  B(q,  0)  are  polynomials  in  q~J  whose  coefficients  are  the  elements  of  0,  that  is, 
let  _ 

A(g,0)  =  1  +  fli?-1  4-  •  •  ■  +  fljn9_2n 
B(q,0)  =  biq-1  +  ---+b3nq'2n 
0T  =  (aj  •  •  •  aj„  b\  •  •  •  fc2„] 


(37) 


This  formulation  coincides  with  (27).  Hence, 


and  the  predictor  becomes 


P(qJ) 

H(qJ) 


B(q,6) 

1 

A(q,6) 


e(t,9)  =  A(q,9)y{t)-B(q,9)u(t) 


(38) 


(39) 


which  is  obviously  stable,  because  A(q,  6)  and  B(q,  6)  are  stable  for  any  6  €  R4n,  thus,  V  =  R4n. 
The  prediction  error  can  also  be  written  as  the  linear  regression 


e(t,9)  =  y(t)  -  9T<f>(t) 
where  <p(t)  £  R4n  is  the  regressor  given  by 

<f>T(t)  -  [-y(<  -  1) - y(t  -  2n)  u(t  -  1)  •  •  •  u(t  -  2n)] 


(40) 


(41) 


In  this  case,  (33)  has  the  well  known  closed  form  solution 

k  =  R:'ba 


(42) 


where 

a.  -  ii;ii.(4)*(i)][i.(5)*(0)T 

JV  (=1 

(43) 

k  =  iE[£.(Sww)[£.(rt»(‘)] 

«=i 


provided  that  Ral  exists.  A  sufficient  condition  is  that  La(q)4>(t)  is  persistently  exciting  and  the 
length  of  the  data  record  N  is  sufficiently  large. 

Definition 

A  sequence  f(t)  £  Rp  is  said  io  be  persistently  exciting  if  there  is  a  positive  constant  f3  and  a 
positive  integer  M  such  that  for  all  r  £  (0,oo) 

!1  r+M  1 

(44) 

Clearly  if  La{q)4>(i)  is  persistently  exciting  and  N  >  M,  then  R~ 1  exists. 


4.1.2  Estimating  the  Nominal  Transfer  Function 


Having  found  9a,  form  the  parametric  plant  transfer  function  estimate,  denoted  by  Pa(q),  using 
the  equation  error  model,  i.e., 

P*(q)  = 

Thus,  we  obtain  the  family  of  parametric  models 


B(qJa) 

^(«>  k) 


(45) 


Observe  that  the  true  plant  Po(q)  belongs  to  every  member  of  the  family  of  sets  of  uncertainty 

{P(Pa,\Aa\),aeA}  (47) 

where  the  true  model  error  is 

Aa(?)  =  Pa(q)  -  Po(q)  (48) 

Using  the  data  record  (y(t),u(t)  :  t  =  we  now  seek  to  find  the  family  of  sets  of 

uncertainty 

{P(P0tS0),a€A}  (49) 

with  the  property  that 

Sa(u)  >  [-v,*},VaeA  (50) 

which  is  sufiicient  to  guaranty  that  for  all  a  £  A  we  have  Pq  £  P(Pa,6a). 

4.1.3  Bias  in  Least  Squares  Parameter  Estimate 


The  parameter  error  0a  —  or  bias,  is  a  result  of  unmodeled  dynamics  and  disturbances.  To 
compute  the  bias,  observe  that  (14)  can  be  expressed  as 


rw  B(q,e.)  * 

Po(9)  A(q,6.)+P^ 

(51) 

Hence,  it  follows  that  (15)  can 

be  written  as 

J(i)  =  +  £(!,«.) 

(52) 

where  the  prediction  error  is 

£(t,e.)  =  A(5,*.)|P(9)u(i)  +  d(i)] 

(53) 

Substituting  this  expression  into  (42)  gives  the  bias  as 

(54) 

where 

(55) 

By  carefully  choosing  the  data  filter  L(q)  and  the  input  spectrum  Suu(w),  it  is  clear  that  the 
bias  can  be  made  small.  Hence,  it  is  possible  to  design  the  identification  experiment  so  that 
P»(q)  ~  P(g,  6.)/A(q,  6.),  and  the  inherent  conservatism  in  the  set  of  uncertainty  P(P0,Sa)  due 

to  parameter  bias  is  insignificant.  Issues  of  experiment  design  will  not  be  pursued  here.  The 
interested  reader  is  referred  to  Ljung(1987)  and  the  references  therein. 

4.2  Non-Parametric  (Spectral)  Estimation 

The  approach  we  propose  for  estimating  the  model  error  is  to  use  standard  methods  of  non- 
parametric  transfer  function  estimation  based  on  spectral  estimation,  e.g.,  Jenkins  and  Watts(196S), 
Ljung(  1985,1987).  We  will  assume  that  u(t)  and  d(t)  are  zero-mean  sequences  with  spectral  den¬ 
sities  Su„(u>)  and  Sdd(u>),  and  cross-  spectral  density  Ss«(w).  These  are  all  defined  analogously 
with  (17).  Assume  that  the  input  and  disturbance  are  uncorrelated,  i.e., 


Sdu(v)  =  0 


(56) 


This  situation  arises,  for  example,  when  the  LSS  is  operating  without  a  stabilizing  feedback,  that 
is,  in  open-loop.  Recall  that  the  LSS  is  open-  loop  stable.  (  The  proceedure  described  below  can 
be  modified  to  account  for  a  stabilizing  feedback.  A  brief  discussion  appears  later  in  the  sequel.) 

Using  the  parametric  transfer  function  estimate  Pa(q),  form  th_'  corresponding  output  error, 

i.e., 

rto(t)  =  y(t)  -  Pa(q)u(t) 

(57) 

=  A  a(q)u(t)  +  d(t) 

where 

Aa(9)  =  Po(q)  ~  P*(q)  (58) 

is  the  true  model  error,  for  which  we  now  seek  an  estimate  based  on  the  data  records 

{^(t),u(<):<  =  1,/..,N)  (59) 

for  each  a  €  A. 

Using  the  same  notation  as  above,  the  cross-spectral  density  between  the  output  error  and 
the  input  is  given  by 

=  Aa(eJ")Suu(u>)  (60) 

The  standard  non-parametric  frequency  domain  estimate  for 
A0(eJU')  is  then 

A„(U>)  =  (61) 

Su  u(w) 

where  ^7o«(  u>)  and  Suu(u)  are  spectral  density  estimates  obtained  from  the  finite  data  record  (59). 
There  are  many  specific  ways  to  generate  spectral  density  estimates.  Following  Ljung(19S5,1987) 
or  Jenkins  and  Watts(1968)  we  use  the  smoothed  spectral  estimators: 

t=-N+1 


SuuH  =  E  ^(r)«w(r)e-^ 

r=-N+l 

where  the  correlation  estimates  are 


Xvo«(T)  =  Tr  E  +  M) 

Jy  t=i 


A.u(t)  =  E  «(*)«(< +  M) 

t=i 

The  function  A-,(r)  is  referred  to  as  the  lag  window  of  length  7,  where  typically  A-,(r)  =  0  whenever 
|r|  >  7.  The  above  spectral  estimates  can  be  equivalently  expressed  as  the  following  frequency 
domain  convolutions: 


Suu(u)  =  f  Wy(u  -  a)I*u(a)do 


(64) 


where  W^(u)  is  the  spectral  window ,  defined  as  the  Fourier  transform  of  the  lag  window,  i.e., 


W»=  £  A,(r)e->- 

t=-N+1 

and  where  /£u(u>)  and  7^(w)  are  the  periodograms  given  by 


(65) 


CM  =  ^(f><*KJ-) 

(66) 

CM  = 


By  combining  the  above  expressions,  the  frequency  domain  estimate  of  model  error  can  be  com¬ 
pactly  expressed  as 

A  (u)  =  (67) 


Observe  that  as  7  increases  the  spectral  window  becomes  more  narrow.  In  addition,  the  spectral 
window  is  usually  characterized  by  the  following  properties: 


W^{u)<Lj  =  1  /*,  wW,(w jdw  =  0 

/-%  =  W(7)  /_%  W*(w)«k  =  £A'(7) 


(68) 


where  as  7  increases,  M(-j)  decreases  and  A'(7)  increases.  Under  these  conditions,  it  can  be 
shown  [see,  e.g.,  Jenkins  and  Watts(1968)  or  Ljung(19S5,l987)j,  that  for  large  N,  large  7,  and 
small  K(i)/N  : 

£.<«)  -  mop  *  u’hw.wy? + (6s) 

where 

RM  =  (7°) 

with  '  and  "  denoting  differentiation  with  respect  to  w,  once  and  twice,  respectively.  Hence,  for 
some  large  N,  large  7,  and  small  A'(7)/jV,  the  error  |Aa(u>)  —  A0(f,w)|  can  be  made  arbitrarily 
small.  This  fact  justifies  the  proposed  uncertainty  modeling  scheme  for  on-line  robust  control 
design.  Specifically,  let  T(P,  C)  denote  the  closed-loop  system  corresponding  to  the  configuration 
of  Figure  ??,  let  T.  denote  the  set  of  all  acceptable  closed-loop  transfer  matrices,  let  Po  denote 
the  true  plant,  and  for  each  a  €  A,  let  P(P0,60)  denote  the  set  of  uncertainty  defined  by 

P {Pa, 60)  =  {P(q)  6  S  :  |P(e*)  -  A(e*)|  <  «.(«)!>  (71) 

We  can  now  state: 

Theorem  2 

Suppose  that  for  all  a  £  A  there  is  a  function 

6a(u)  >  |A0(u;)|,  Vu>  €  (-*,*■]  (72) 

If  there  is  an  a.  £  A  and  a  corresponding  compensator  Ca.(q),  such  that 

T(P,  Ca.)  £  T.,  VP  €  P(Pa.,Sa.) 


(73) 


(74) 


then,  for  some  large  N,  large  7,  and  small  K{*t)/N , 

T(P0,Co.)eT. 


Theorem  2  provides  conditions  under  which  the  true  plant  is  a  member  of  the  estimated  set 
of  uncertainty.  Hence,  any  controller  which  robustly  stabilizes  the  set  of  uncertainty  will  also 
stabilize  the  true  plant.  In  this  context  robust  stabilization  means  that  the  closed-loop  system 
T(P,  C„.)  €  T.  for  all  P  in  the  set  of  uncertainty. 

The  difficulty  in  applying  the  theorem  is  to  properly  select  N,*f,  and  most  importantly,  the 
function  ^(w).  A  natural  choice  for  the  latter  is 

^(u>)  =  [l-fJfc(u,)]|Aa(u»)|  (75) 


where  k(u)  >  0.  Hence,  6a(u)  >  |A0(eJ“)|,Va  €  A  if 

-m  <  TTwy  v“ 6  ^ 


where 


1/2 


For  large  7,  typical  spectral  window  characteristics  behave  like 

Hence,  for  large  7, 


M( 7)»~,  *(7)*A'o7 

7 


1  (K  m  ,uv, 2  .  js  7  Sdd(u)\1/2 
|A.(e*")|  [  74  lRa(  }l  °NSUV(“)) 


(76) 

(77) 

(78) 

(79) 


Observe  that  if  Suu(u)  is  more  or  less  constant  for  u>  <  fln/Slta  and  attenuates  rapidly  thereafter, 
which  is  usually  the  case,  then  JZ0(u>)  «  (l/2)A”(u>).  Thus,  for  some  large  7,  6a(u)  >  |A0(e*")|, 
provided  that 


Mo 

272 


K(e}u) 


A0(e>“') 


AM  „,d  , 


l  +  *(  u)  N  Suu(u>) 


(SO) 


4.2.1  Summary  of  Uncertainty  Estimation  Procedure 

The  following  steps  summarize  the  above  procedure  for  estimating  model  uncertainty: 

Step  1  Given  a  preselected  filter  parameter  o  €  -4,  form  the  filtered  prediction  error,  and 
then  solve  for  9a  from  (33). 

Step  2  Using  the  parametric  transfer  function  estimate  P(q,  6a),  form  the  output  error  data 
record  (59),  and  then  calculate  the  frequency  domain  model  uncertainty  estimate  A0(u>)  from 
(67),  using  standard  spectral  estimation  procedures. 

Step  3  Design  a  robust  control  based  on  the  plant  being  a  member  of  the  estimated  set  of 
uncertainty  P(P0,  ifc|A0|),  as  defined  in  (71)  for  some  function  k(u>)  >  0,Vu>  £  [— ir.ir]. 

Step  4  If,  for  some  a.  €  A,  there  is  a  feedback  compensator  C0.($),  such  that  the  closed-loop 
systems  in  the  set. {T(P,  Ca.),  VP  £  P(Pa.,^.)}  satisfy  performance  objectives,  then  implement 
the  compensator.  Otherwise,  go  to  Step  5. 

Step  5  Select  a  new  value  for  the  filter  parameters  a  £  A  and  go  to  Step  1.  If  all  filter  param¬ 
eter  values  in  A  have  been  exhausted,  then  implement  that  compensator  in  the  set  {Ca,  a  £  A) 
which  produces  the  best  closed-loop  performance,  i.e.,  the  nearest  to  the  performance  objectives. 


4.3  Closed-Loop  Uncertainty  Estimation 


Suppose  that  the  LSS  system 


y(i)  =  Po(q)u(t)  +  d(t) 


is  operating  in  closed-loop  with  the  stabilizing  feedback 


(81) 


u(t)  =  r(t)  -  C0(q)y(t) 


(82) 


The  least-squares  parametric  procedure  for  obtaining  P„  will  still  work,  but  the  spectral  approach 
for  obtaining  the  model  error  estimate  Aa  needs  to  be  modified.  The  reasoning  is  as  follows:  let 
r(t)  =  0  and  suppose  that  the  spectral  estimates  Sr)av(  u>)  and  S’ttu(w)  are  very  close  to  the  true 
spectra  SnaU(u))  and  Sul4(w),  respectively.  We  then  have 


Aa(u)  -  Aa(eJ") 


■SnuM 

Suu(u) 


a 


1 

C0(c>“) 


+  Po(eiw) 


(83) 

(84) 


which  can  be  an  arbitrarily  bad  estimate.  To  correct  this  difficulty,  suppose  that  r(t),  which 
is  available  to  the  user,  is  selected  so  that  it  is  uncorrelated  with  the  disturbance  d(t)  and  its 
spectrum  is  much  larger  than  the  disturbance  spectrum,  i.e.,  Srd(u)  =  0  and  5rr(w)  S^w).  If 
the  spectral  estimates  are  close  to  their  true  values  then 


Aa(u)-A0(en 


(85) 

(86) 


Thus,  by  careful  external  input  selection  (experiment  design),  it  is  possible  to  use  the  uncertainty 
estimation  procedure  also  in  closed  loop  with  a  nominal  stabilizing  controller. 


5  Example  of  Uncertainty  Estimation 


In  this  section  the  uncertainty  estimation  procedure  is  applied  to  data  obtained  from  the  laser 
pointing  experiment  described  in  Walker  et  al.(1984).  A  schematic  drawing  of  the  apparatus  is 
shown  in  Figure  4. 


Figure  4:  Laser  pointing  and  control  experiment 


The  objective  of  the  experiment  is  to  control  the  jitter  of  the  laser  beam.  The  single  actuator 
consists  of  a  pivoted  proof-mass  which  exerts  a  reaction  force  on  the  flexible  beam  whenever 


the  proof-mass  is  moved  by  an  applied  control  current  through  the  armature.  As  the  flexible 
structure  vibrates,  the  laser  beam  changes  its  angular  direction.  A  quadrature  detector,  mounted 
on  the  structural  support,  registers  the  laser  beam  position,  as  long  as  it  is  in  the  field-of-view 
of  the  detector.  The  laser  beam  strikes  a  mirror  on  the  flexible  structure  and  is  reflected  by 
another  mirror  mounted  on  the  proof-  mass  actuator.  The  resulting  beam  is  split  in  two  by  a 
beam-splitter,  with  one  ray  going  to  the  quadrature  detector  and  the  other  to  a  screen  where  the 
jitter  is  magnified  for  visual  inspection.  The  mass  of  the  actuator  is  greator  than  the  mass  of  the 
flexible  structure,  thereby  insuring  a  significant  interaction  between  the  actuator  and  the  flexible 
structure. 

The  problem  is  to  control  the  jitter  from  4  Hz  to  20  Hz.  The  input  u(f )  is  chosen  as  a  sine- 
sweep  lasting  about  16  seconds,  sweeping  from  4  Hz  to  20  Hz  and  sampled  at  51.2  Hz.  The 
number  of  data  samples  is  N  =  1024. 


Figure  5:  (a)  DFT(u),  (b)  DFT(y),  (c)  ETFE  and  P0(e>“') 


Figure  5  shows  the  discrete- Fourier  -transform  (DFT)  of  the  input  and  output,  as  well  the  ratio 
of  output  DFT  to  input  DFT,  referred  to  as  the  emperical  transfer  function  estimate  (ETFE), 
compared  to  the  “true”  transfer  function  Po(e}U).  Since  the  data  is  taken  from  a  physical  device, 
the  true  system  dynamics  are  not  really  known.  What  we  refer  to  as  the  “true”  system  is  an  8th 
order  equation  error  model  obtained  from  the  data  by  using  the  least  squares  estimate  (42)  with 
n  =  m  =  8.  This  gives 

Bo(q,6) 

Ml,  6) 

-.0044g_1  +  .0016g“J  -  .056g'3  -  ,ei5g“4 
+  .0l3g-5  -  .Ollg"6  -  .033g-7  -  .027q‘8 
1  -  l.llg"1  +  .54g-2  +  .47g-3  -|-  .77g~* 

-  .17g“5  +  .08g~6  +  .40g-7  +  .0057g“# 


Po{q)  = 

B(q,9)  = 
Ao(g,0)  = 


The  parametric  model  is  chosen  as  a  4th-order  equation  error  model  as  in  (36),  where 


4(g, 0)  =  1  +  Qi9-1  +  •  •  •  +  a*q~4 
B(q,0)  =  bxq~'  +  •  •  •  +  b4q~* 
eT  =  [a1,...,a4,61,...,64] 

The  auxiliary  filter  parameter  set  is  the  4-tuple, 

.4  =  {0,1, 2, 3} 

such  that  the  corresponding  filters  have  the  following  properties: 

Lo(q)  =  1,  the  “natural”  filter.  . 

L\(q)  =  8th-order  Butterworth  with  passband  [.-Htt,  .677t). 

£2 (q)  —  8th-order  Butterworth  with  passband  [.llir,  .40jt]. 

£3(5)  =  16th-order  Butterworth  with  passband  [.15tt,  .40jt). 

The  effect  of  each  filter  is  shown,  respectively,  in  Figures  6  to  9.  Each  figure  shows  3  plots: 
the  two  top  ones  showing  gain  and  phase  of  the  estimated  plant  (the  dashed  lines)  compared  to 
the  true  plant  (the  solid  lines),  and  the  bottem  plot  comparing  magnitudes  of  the  model  error 
estimate  (the  dashed  line)  from  (67)  with  the  true  model  error  (the  solid  line). 

The  top  and  bottem  plots  also  show  the  magnitude  of  the  passband  filter  used  to  form  the  fil¬ 
tered  prediction  error.  Observe  that  the  model  error  estimates  are  quite  accurate  in  the  frequency 
range  [.15,  .70],  which  is  the  same  range  where  the  input  has  a  nice  flat  DFT.  The  model  error 
estimates  were  obtained  using  a  rectangular  window,  and  no  attempt  was  made  here  to  adjust 
the  lag  window  width  to  achieve  a  better  resolution.  The  best  model  estimate  is  clearly  for  the 
filter  £3(9),  particularly  over  the  passband  of  interest. 

These  results  give  confidence  to  the  uncertainty  model  estimation  procedure  proposed  here, 
because  despite  the  fact  the  some  of  the  estimated  transfer  functions  are  quite  poor,  the  model 
uncertainty  estimate  is  very  good.  Hence,  a  robust  control  designed  on  the  basis  of  the  estimated 
set  of  uncertainty  would  not  de-stabilize  the  true  system.  The  next  step  is  to  continue  with  the 
above  example,  for  example,  and  ir.  corporate  a  specific  robust  design  control  design  method. 


6  Concluding  Remarks 

A  method  for  estimating  the  set  of  uncertainty  of  a  plant  transfer  function  has  been  proposed 
and  analyzed.  Theorem  2  provides  analytic  justification  that  the  true  plant  can,  under  suitable 
conditions,  be  guaranteed  to  be  in  the  estimated  set  of  uncertainty.  An  example  is  presented 
which  supports  the  theory.  One  of  the  restrictions  in  this  paper  is  that  the  plant  is  stable  and  no 
feedback  is  present  during  the  experiment.  The  results,  however,  can  be  extended  to  the  case  of 
an  unstable  plant  operating  in  closed-loop  with  a  stabilizing  feedback  compensator. 

Although  in  this  paper  we  restrict  the  auxiliary  parameters  to  essentially  index  data  filters, 
the  concept  can  be  broadened.  For  example,  the  filters  can  be  chosen  differently  for  the  input  and 
output  data.  In  addition,  the  auxiliary  parameters  can  index  various  inputs  which  have  different 
spectral  content  and  the  choice  made  by  using  a  local  optimization  on  the  auxiliary  parameters. 
The  index  parameters  are  thus  choices  which  can  be  made  to  tailor  the  identification  experiment 
in  accordance  with  the  use  of  the  model.  Off-  line  procedures  for  experiment  design  are  discussed, 


Figure  6:  (a)  gain,  (b)  phase,  (c)  model  error  magnitude 
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Figure  7:  (a)  gain,  (b)  phase,  (c)  model  error  magnitude 


for  example,  in  Ljung(1987)  or  Goodwin  and  Payne(1977).  The  issue  addressed  here,  which  might 
be  called  “on-line  experiment  design”,  remains  to  be  studied  and  further  developed. 

The  results  of  this  paper  raise  some  questions,  like:  why  not  use  non-parametric  spectral 
estimation  to  obtain  the  transfer  function  estimate  in  the  first  place?  The  answer  is  partly  to  be 
found  in  Ljung(  1985,1987),  where  it  is  shown,  as  might  be  expected,  that  the  parametric  tech¬ 
niques  have  a  smaller  variance  (asymptotically)  as  a  function  of  data  length.  Here,  we  are  using 
the  parametric  model  for  the  critical  frequency  range  where  control  is  needed,  whereas  a  cruder 
spectral  analysis  is  used  for  the  frequency  range  oyer  which  control  is  not  as  important.  Another 
question:  why  not  just  increase  the  model  order,  and  hence,  stay  with  parametric  methods?  This 
approach  is  taken  in  Wahlberg(19S6),  where  after  identification  a  model  reduction  is  performed. 
The  problem  is  that  we  now  have  one  plant  estimate  and  no  set  of  uncertainty,  i.e.,  the  esti¬ 
mate  is  assumed  to  be  sufficiently  accurate.  Hopefully  it  is,  but  this  can  only  be  established  by 
an  a  priori  analysis,  i.e.,  there  is  no  on-line  estimate  of  the  set  of  uncertainty.  It  may  be  that 
such  an  estimate  is  unnecessary  for  a  particular  problem,  provided  that  the  experiment  has  been 
appropriately  designed. 
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1  Introduction 


Our  concern  is  with  the  adaptive  control  of  plants  with  both  parametric  (structured)  uncer¬ 
tainty  and  dynamical  (unstructured)  uncertainty.  The  plant  is  linear,  and  subject  to  distur¬ 
bance  and  measurement  noise.  Knowing  the  value  of  the  uncertain  parameters  allows  for  the 
design  of  a  robust  controller  to  cope  with  the  unstructured  uncertainty.  Moreover,  there  is  no 
robust  controller,  i.e.,  one  that  is  linear-time-invariant,  that  will  handle  the  entire  spread  of 
structured  and  unstructured  uncertainty.  Hence,  there  is  the  need  for  an  adaptive  control. 

More  precisely,  let  V  denote  the  family  oi  plants  given  by 

V  =  {P{z)  =  P(z,a)[  1  +  A(r,a)]  :  o  €  A,  |A((e^),a)|  <  |*((e*-),a)|,Vu,  €  [-*,*]}  (1) 

where  o  is  the  structured  parameter  constrained  to  a  subset  A  of  Rp,  and  A(z,o)  is  the 
unstructured  uncertainty  bounded  by  the  weighting  function  6(z,  a).  Assume  that  there  is  a 
finite  number  of  parameter  “design”  values  Oj, . . . ,  a^,  each  in  A,  such  that  for  any  a  €  A,  not 
necessarily  equal  to  aj, . . .  ,(*n,  there  exists  a  controller  Ci(z),  tuned  to  a,  for  :ome  i,  which 
acheives  satisfactory  performance  in  the  face  of  the  unstructured  uncertainty,  and  in  the  face 
of  the  difference  between  the  actual  value  of  a  and  the  design  value  a,.  For  example,  the  t'-th 
controller  C,(z)  in  the  set  Ci(z), . . . ,  Cjv(z),  provides  robust  performance  for  any  plant  in  the 
family 

Vi  =  {P(z)  €V:\\a-  a<||  <  m,}  (2) 

where  Vi, . . .  ,Vn  completely  cover  the  plant  set  V ,  that  is 

vc\j  Vi  (3) 

«=i 

Our  task  in  this  paper  is  to  explain  how,  when  a  is  unknown,  the  controller  Ct(z)  can  be 
selected  to  control  the  plaint.  Jhis  type  of  adaptive  controller  is  not  like  the  conventional  ones 
where  either  model  or  controller  parameters  are  directly  adjusted.  Here  the  parameters  are 
contained  in  the  mechanism  for  switching  amongst  the  preselected  robust  controllers,  which  is 
essentially  a  gain  scheduling  proceedure,  but  is  adaptive  in  the  sense  that  the  schedule  is  being 
learned  from  the  measured  data.  The  gain  schedule  is  usually  set  in  advance,  for  example, 
in  a  flight  control  system  the  gain  schedule  is  a  predetermined  function  of  the  Mach  number 
and  aerodynamic  pressure.  One  of  the  interesting  possible  adavantages  of  this  method  is  that 
although  the  plant  may  have  a  large  number  of  uncertain  parameters  (a  in  the  above  notation), 
it  is  possible  that  only  a  few  controllers  are  required,  and  hence  only  a  few  parameters  in  the 
selection  mechanism.  Also  the  individual  robust  controllers  can  be  based  on  uncertainty  in 
physical  parameters  rather  than  canonical  parameters,  such  as  transfer  function  coefficients, 
as  used  in  the  conventional  parameter  estimation  techniques,  e.g.,  least  squares  with  a  linear 
regression  model. 

The  idea  of  using  preselected  robust  controllers  as  described  above  is  due  to  K.  Poola  [1] 
who  also  proposed  a  particular  smooth  adaptive  selection  algorithm.  Here,  we  examine  the 
adaptive  selection  mechanism  as  discussed  in  [2]  and  [3],  which  will  be  described  in  detail  in 
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the  sequel.  In  [3]  this  scheme  is  referred  to  as  a  Multiple  Model  Adaptive  Control  (MMAC), 
but  the  individual  controllers  are  not  necessarily  selected  to  be  robust  in  the  manner  described 
above.  Here  we  also  provide  an  analysis  of  the  convergence  properties  of  the  adaptive  selection 
algorithm  following  the  analysis  in  [2], 


2  Structure  of  the  Adaptive  Controller 

Figure  1  depicts  the  general  adaptive  set-up  that  we  consider.  The  design  of  each  controller 
Ci(z )  is  not  an  adaptive  control  design,  but  rather,  a  robust  control  design  task  involving 
the  plant  family  V  and  the  coverings  V,.  The  signals  used  to  adaptively  select  the  controller 
are  the  innovations  sequences,  or  prediction  errors,  denoted  by  £i(t), . . .  ,£n(<)>  where  each  is 
obtained  from  the  Kalman  filters  denoted  KFi,.. .  ,KFn. 

Although  there  are  many  structures  for  the  individual  controllers  C,(z),  one  structure 
which  utilizes  the  prediction  errors  is  shown  in  Figure  2.  In  this  case  C,(z)  consists  of  an 
observer  based  state  feedback  controller  together  with  an  auxiliary  signal  obtained  by  pro¬ 
cessing  the  observer  innovations  sequence  £,(f)  through  a  stable  transfer  function  Q,{z).  An 
important  aspect  regarding  the  flexibility  of  this  controller  structure  is  that  each  C,(z)  is 
a  parametrization,  in  terms  of  stable  Qi(z),  of  all  stabilizing  controllers  of  the  plant  model 
P(r,  a,),  see,  e.g.,[4].  That  is,  all  controllers  which  stabilize  P(z,at)  are  obtained  by  letting 
Q%(z),  referred  to  as  the  Youla  parameter ,  range  over  all  stable  transfer  functions. 

The  adaptive  selection  'mechanism  can  be  abrupt  or  smooth,  and  we  will  in  the  full  paper 
examine  in  detail  both  of  these  choices.  First,  we  review  the  results  in  [2]  amd  then  provide 
the  extension  to  the  feedback  case.  The  adaptation  mechanism  is  the  real  object  of  interest  in 
this  paper.  We  set  it  up  using  the  ideas  of  [2]  and  [3].  In  broad  outline,  the  idea  is  this:  Design 
predictors  (e.g.,  Kalman  filters)  which  are  optimal  for  each  of  the  plants  P(z,  a ;),  t  =  1, . . . ,  N. 
(Their  design  also  requires  values  to  be  assigned  to  noise  covariances.)  We  then  process  the 
innovations,  or  prediction  errors,  from  each  predictor  and  compute  certain  quantities,  roughly, 
the  scalar  sample  covariance.  The  index  value  im  €  [1, ...  ,1V]  whose  predictor  produces  the 
smallest  sample  covariance  is  selected  and  the  corresponding  Ci,(z )  controller  is  used  to  control 
the  plant. 


3  Review  of  [2] 

Two  issues  are  examined  in  [2].  First,  when  the  plant  is  in  the  model  set,  and  secondly,  when 
it  is  not. 
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3.1  Plant  in  Model  Set 


Let  a  plant  be  drawn  from  the  collection  { P(z ,  a*) :  t  =  1, . . . ,  N),  say  the  plant  is  P(z,  ato). 
Suppose  the  plant  has  disturbances  and  measurement  noise,  but  no  exogenous  input.  Let  N 
Kalman  filters,  each  tuned  to  {P(z,  a*)  :  i  =  1,. . . , ./V},  be  connected  to  the  plant.  Let  the 
asymptotic  design  innovation  covariances  be  and  let  e,(t)  denote  the  sequence  obtained  from 
the  t-th  filter  connected  to  P(z,  alo),  at  the  point  in  the  filter  where  the  innovation  sequence 
would  normally  be  observed.  Recall  that  the  quantities  £,(<)  and  Q,  are  available  from  the 
Kalman  filter  and  covariance  equations,  respectively.  Moreover,  the  design  covariance  f 1,  is 
computible  in  adavance  of  the  measurements.  Note  also  that  £to(t)  is  actually  an  innovation 
sequence,  but  £,(<)  for  i  ^  i0  is  not  in  general.  Define,  for  i  ^  i0, 


L,(t) 


P(a,e|2') 


(4) 


where  z *  denotes  the  sequence  of  measurements  y(l), . . .  ,y(<),  and  p(a|z‘)  axe  the  a  posteriori 
probabilities  which,  assuming  gaussian  distributions,  are  sequentially  computible  from 


P(Qil^*~1)\/^et(0,~1) exp{— (t)fl,  ^.(t)} 

LjLi  P(*jl2t~1)]/det(n-1)exp{-%eJ(t)Cl-1£J(t)} 


It  follows  immediately  that 


lnLi(t)  =  In  Li(t  -  1)  +  |[ej(<)ftr.1«<.(0  -  eJWi 1  £.(<)] 
Thus,  as  t  — ►  oo,  and  for  all  i  ^  i0, 


where  for  all  i, 
with 


|ln|L,(l)]--(V'-Vj.) 

V,  =  lndet(fi,)  +  tr 


(6) 

(7) 

(8) 


E, 


1  k=\ 


(9) 

(10) 


The  expectation  operator  £(•)  is  of  course  taken  with  respect  to  the  noise  processes.  By 
definition  of  optimality, 

<  V,  Vi  ?  i0  (11) 


and  thus,  Vt  ^  i0,  as  t  — ►  oo, 


Pfol**)  .  n 

P(*d*f) 


(12) 
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exponentially  fast  with  asymptotic  convergence  rate  exp{— (V,  —  Vi„)t}.  This  means  that  as 
t  — »  oo, 


and  hence,  the  correct  plant  model  is  captured.  Intuitively  this  is  as  expected,  because  the 
plant  P(z,  qIo)  is  in  the  model  set  { P(z ,  o,) :  *  =  1, . . . ,  N),  and  moreover,  one  of  the  Kalman 
filters  is  optimal. 


3.2  Plant  Not  in  Model  Set 

Next,  in  [2],  the  casfe  is  cbnsicfered  when  the  plant  (and  noise  covariances)  is  not  in  the  model 
set.  Specifically,  let  the  true  plant  be  given  by 

P0(z)  =  P(2,a0)[l  +  A(z,a0)]  (14) 

for  some  a0  £  A,  where  now  none  of  the  Kalman  filters  is  optimal.  The  question  is:  if  one 
acts  as  if  the  true  plant  were  in  the  model  set,  and  the  probabilities  are  computed  as  before, 
then  what  actually  happens? 

The  answer  is  that  the  quantities  p^jjz*)  computed  from  (5),  which  might  now  be  more 
correctly  referred  to  as  pseudo-probabilities,  behave  exactly  as  before,  namely,  that  as  t  — ►  oo, 


where 


;  i: 


t.  =  arg  min  K 

«€[1 . N] 


Hence,  the  algorithm  selects  the  plant  model  P{z,  a,.)  in  the  model  set  {P(z,  a,)  :  t  = 
1, . . . ,  N}  which  is  closest  to  the  true  plant  P0{z )  in  the  sense  of  minimizing  V<  over  all  i.  As 
shown  in  [2],  this  is  equvalent  to  minimizing  the  Kullback  information  measure  computed  on 
an  asymptotic  per  sample  basis. 

We  remark  also  that  if  <$,(w)  and  $0(u;)  are  the  model  and  plant  output  spectrum,  respec¬ 
tively,  then 

*i(w)  =  Wiie^Wfie-n  (17) 

*o(w)  =  W0{t’“)Sl0Wl{e~n  (18) 

where  W~l(z)  and  W~1(z)  are  the  transfer  functions  from  the  measurement  sequence  y(t)  to 
the  model  innovations  £,(<)  and  true  innovations  e0(t),  respectively.  Note  that  these  transfer 
functions  can  always  be  selected  to  be  stable  and  stably  invertible.  Hence,  we  can  compute 

2.  by 

£,•  =  ^  f_W-\en$0{u>)[W-\t~n]Tdw  (19) 
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and  also 


trfQr'E.-)  =  ^  J\t[$0(u)*-'(u:)}<Lj 


(20) 


Finally,  we  get 

Vi  =  lndet(ft,)  +  J  tr[$0(u;)$”1(a;)]  du>  (21) 

l  •  1  ** 

4  The  Effect  of  Feedback  and  Exogenous  Inputs 

We  work  now  to  consider  the  arrangement  depicted  in  Figure  3.  Let  the  plant  system  be 
described  by 

y((t)  =  P0(z)u(t)  +  W0(z)v0(t)  (22) 

where  v0(t )  is  a  zero  mean  white  noise  sequence  with  covariance  matrix  fi0.  The  feedback 
system  is 

u(t)  =  C(z)[r(t )  -  y(t)]  (23) 

where  r(t)  is  a  stationary  zero  mean  sequence  with  spectrum  4>r(u;).  The  controller  C(z)  may 
be  one  of  the  avaiable  set  of  controllers  C\(z), . . . ,  Cjv(z).  Recall  that  each  controller  CAz) 
is  designed  to  stabilize  any  plant  in  the  set  Vj  defined  in  (2).  Observe  that  if  the  true  plant 
P„(z )  is  not  in  this  set,  then  the  closed  loop  system  may  be  unstable  if  Cj(z)  is  applied. 

The  block  labeled  KFi  denotes  a  Kalman  filter  which  is  designed  for  the  plant  model 

y(t)  =  P(z,  c*i)u(t)  +  W(z,  aii)vi(t)  (24) 

where  Vi(i)  is  a  zero  mean  white  noise  sequence  with  covariance  matrix  flt.  Hence,  the  inno¬ 
vations  sequence  e,(t)  from  KF,  is  given  by 

£i{t)  =  W~\z,  a i)[y(t)  -  P(z ,  a.)u(t)]  (25) 

The  difference  here  in  relation  to  [2]  is:  (i)  the  inclusion  of  an  external  input  u(f)  to 
the  plant  and  the  Kalman  filter,  and  (ii)  the  generation  of  that  input  by  a  combination  of 
an  exogenous  input  r(t)  and  a  feedback  compensator  C(z).  We  now  examine  how  the  filter 
selection  algorithm  (5)  behaves  in  two  cases,  namely,  when  C(z)  stabilizes  P0(z)  and  when  it 
does  not. 


4.1  Stabilizing  Feedback 

Suppose  that  the  feedback  compensator  C(z)  stabilizes  P0(z).  In  this  case  the  pseudo¬ 
probabilities  p(ai|z*)  are  again  computed  from  (5),  but  now  we  take  the  set  of  measurements 
as  z{  =  {y(k),u(k)  :  k  =  The  result  (15),  (16)  is  the  same  as  before:  the  algorithm 

selects  the  plant  model  that  is  closest  in  the  sense  of  the  information  measure,  i.e.,  that  plant 
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*  * 

model  P(z,a,),  W(z,cti)  which  is  closest  to  the  true  plant  P0(z),  W0(z)  in  the  sense  of  mini¬ 
mizing  V{  in  (21).  What  is  different  is  the  formula  for  Vi.  To  compute  this  first  observe  that 
(we  drop  the  explicitly  shown  dependence  on  z,  t ,  and  a,  to  simplify  notation) 

£,  =  W~'[(P0  ~  Pt)u  +  W0u0]  (26) 

and  under  the  feedback  action 

u  =  C(I  +  P0C)-1[r-W0u0]  (27) 

Note  that  by  saying  that  C(z)  stabilizes  P0(z)  we  mean  that  the  transfer  functions  C(I  + 
P0C)~l ,  (/  +  P0C)-1,  and  P0(J  +  P0C)~l  are  all  stable.  Hence,  we  get 

£i  =  HiWov0  4-  Gir  (28) 

where 

H,  =  W~\I  +  PiC)(I  +  PoC)-1  (29) 

and 

Gi  =  W~\P0  -  Pi)C(I  +  PoC)'1  (30) 

Assuming  that  r(t)  and  vQ{t)  jure  uncorrelated,  then 

$e.(o;)  =  Hi{tn*0{u)Hj{e-j")  +  Gi(en*r(u>)Gj(e-n  (31) 

and  hence 

S,  =  ^r  ^  (32) 

To  get  better  insight  into  this  formula,  observe  what  happens  when  we  model  the  plant  input- 
output  dynamics  correctly,  that  is,  when  P,  =  P„.  In  this  case  the  presence  of  the  external 
input  has  no  effect  and  we  get 

E‘  =  S  L  (33) 

as  before.  In  the  more  general  case  with  a  stabilizing  feedback,  the  above  formula  for  E,  is 
effected  by  errors  in  both  the  dynamical  model  due  to  P0(z)  —  P(z,  a,)  and  in  the  noise  model 
via  W0(z)  —  W(z,a,).  Signal  to  noise  ratio  is  also  relevant,  for  example,  if  $r(u>)  >>  $c(u>), 
then  the  term  involving  G,(z)  tends  to  dominate,  i.e.,  the  noise  model  error  is  not  as  important 
as  the  dynamical  model  error. 

i  .  • 

Other  insights  can  also  be  obtained  by  using  the  expressions  derived  in  [5],  such  as 


(34) 

where 

Ti  =  [P0  -  Pi  Wo-Wil 

(35) 

Further  interpretations  will  be  provided  in  the  full  paper. 
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4.2  Unstabilizing  Feedback 


It  could  be  that  the  adaptive  selection  algorithm  (5)  picks  out  a  control  which  would  stabilize 
one  of  the  plant  models,  but  de-stabilizes  the  true  plant.  The  question  arises  as  to  what 
happens  if  the  same  algorithm  is  used  to  drive  the  adaptive  switch.  Clearly,  the  limiting 
process  in  (21)  will  no  longer  be  valid. 

It  is  shown  in  the  full  paper  that  if  the  dominnent  unstable  mode  of  the  closed  loop  system 
is  at  z  =  C»  ICI  >  1>  then  the  selection  algorithm  identifies  that  plant  P(z,  a,)  for  which 
P(£,a,)  is  “most  like”  P0( £)•  If  P((,a,)  =  P0(Q,  then  the  controller  C,(z)  appropriate  for 
P(z,  a,),  when  connected  to  P0(z),  will  not  produce  an  unstable  mode  at  (.  Nor  will  it  do  so 
ifP(d<*.)  fts  P0(C)  i  because  the  controller  is  robustly  designed  to  stabilize  all  plants  in  the 
set  Vi  defined  in  (2).  However,  there  is  no  a  priori  guaranty  that  it  will  give  no  other  unstable 
mode  C+  7^  C-  So  despite  the  fact  that  there  is  a  systematic  way  of  switching  out  an  unstable 
controller,  there  is  no  guaranty  that  instability  can  be  instantaneously  repaired.  One  could 
randomly  select  any  alternative  control  from  the  set  Ci(z), . . .  ,Cn(z),  or  better,  whenever 
instability  is  deemed  to  occur,  say  whenever  ||y(t)||  exceeds  some  threshhold,  a  stabilizing, 
but  low  authority  controller,  can  be  switched  in  until  the  adaptive  system  selects  a  new  plant/ 
controller. 

5  Algorithm  Modifications 

The  pseudo-probability  algorithm  (5)  can  be  modified  in  a  number  of  ways.  The  influence  of 
past  data,  which  tends  to  make  the  algorithm  sluggish,  can  be  reduced  by  discarding  old  data. 
For  example,  suppose  that  the  controller  selection  is  to  be  based  on  the  smallest 

'  Vi(t)  =  lndet(fti)  +  l  £  V-*5e?(*)ftrle<(fc)  (36) 

1  k= i  z 

where  A  €  (0, 1]  is  the  usual  “forgetting  factor”  which  exponentially  discards  old  data.  Let  p,-(t) 
denote  the  pseudo-probability  associated  with  V'i(t)  above.  The  algorithm  (5)  now  becomes 


\pi{t  -  l^Vdet^r^exp {-§ef(t)fti 1  £,(<)} 

(37^ 

P'  £&,[*(<  " 

) 

Following  the  derivations  in  [2],  we  now  get 

In Li(t)  =  Ain L,(t  -  1)  +  £,.(<)  -  efWT'eM 

(38) 

where 

t.  =  argmin  Vi(t) 

(39) 

and  for  t  ^ 

(40) 
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(41) 


As  in  the  previous  cases,  it  can  be  shown  that  as  t  — ►  oo, 


1,  *  =  »• 
0,  t  ^  ». 


exponentially  fast. 

Other  variants  will  be  explored  in  the  full  paper. 


6  Adaptive  Selection 

In  the  full  paper  we  will  provide  a  detailed  analysis  when  the  the  control  is  given  by 

=  (42) 

i=i 

with 

«<(<)  =  Ci(z)[r(i )  -  y(<)],  *  =  1, . . . ,  N  (43) 

where  ft (t), . . .  ,ftv(t)  are  selection  coefficients,  or  weights,  which  are  determined  from  the 
pseudo-  probabilities  pi(t), . . .  ,P//(f).  We  will  examine  two  types  of  adaptive  selection  mecha¬ 
nisms,  namely  (i)  abrupt  selection,  and  (ii)  smooth  selection.  In  either  case  it  may  be  necessary 
to  also  provide  for  a  back-up  controller 

«o(<)  =  C0(z)[r(t)  -  y(t)]  (44) 

which  is  of  low-authority,,  but  stabilizes  all  plants  in  V.  This  control  can  be  switched  in 
whenever  an  “instability”  is  deemed  to  occur. 


6.1  Abrupt  Adaptation 


In  this  case  we  adjust  the  weights  ft(f)  abruptly.  For  example: 


1,  pi{i  -  1)  <  pth  <Pi(t) 

0,  otherwise 


(45) 


where  ptK  €  (0, 1)  is  some  threshhold  value,  probably  close  to  one.  Hence  ,  the  current  control 
is  held  until  one  of  the  pseudo-probabilities  is  sufficiently  large,  and  then  a  new  control  is 
switched  on  in  its  place. 


6.2  Smooth  Adaptation 

To  avoid  unpleasentries  associated  with  abrupt  switching  one  can  swithch  smoothly.  The  most 
obvious  choice  is  simply  to  set 

Pi(i)  =  Pi{*)  (46) 
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Adaptive  Control 
via 

Finite  Modeling  and  Robust  Control 

Robert  L.  Kosut’and  Brian  D.O.  Anderson* 


Abstract:  This  papers  examines  an  adaptive  con¬ 
trol  scheme  which  involves  the  selection  of  a  fixed 
controller  out  of  a  finite  collection,  where  each  con¬ 
troller  is  robustly  designed  to  account  for  the  true 
plant  being  in  a  not  necessarily  small  set  of  uncer¬ 
tainty.  There  are  no  adaptive  “parameters”  in  the 
conventional  sense  where  adjustments  are  made  to  ei¬ 
ther  model  or  controller  parameters.  The  parameters 
here  involve  pseudo-probabilities  or  weights  assigned 
to  each  of  the  finite  controllers.  These  quantities  are 
computed  recursively  from  the  measured  data. 

1  Introduction 

Our  concern  is  with  the  adaptive  control  of  plants 
with  both  parametric  (structured)  uncertainty  and 
dynamical  (unstructured)  uncertainty.  The  plant  is 
linear,  and  subject  to  disturbance  and  measurement 
noise.  Knowing  the  value  of  the  uncertain  param¬ 
eters  allows  for  the  design  of  a  robust  controller  to 
cope  with  the  unstructured  uncertainty.  Moreover, 
there  is  no  robust  controller,  i.e.,  one  that  is  linear- 
time-invariant,  that  will  handle  the  entire  spread 
of  structured  and  unstructured  uncertainty.  Hence, 
there  is  the  need  for  an  adaptive  control. 

More  precisely,  let  V  denote  the  family  of  plants 
given  by 

V  =  (/>(:)=P(:,«)[l  +  d(:,tt)):«€A 
|A(^,o)|<|tf(^.a)|,V-,€[-T,T]} 

(1) 

where  a  is  the  structured  parameter  constrained  to  a 
subset  A  of  Rp,  and  A(r,a)  is  the  unstructured  un¬ 
certainty  bounded  by  the  weighting  function  <5(r,a). 
Assume  that  there  is  a  finite  number  of  parameter 
“design”  values  or x,...,ajv,  each  in  A,  such  that 
for  any  a  €  A,  not  necessarily  equal  to  qj,  . . . ,  as, 
there  exists  a  controller  C;(i),  tuned  to  a,  for  some 
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»,  which  acheives  satisfactory  performance  in  the  face 
of  the  unstructured  uncertainty,  and  in  the  face  of  the 
difference  between  the  actual  value  of  a  and  the  de¬ 
sign  value  a,-.  For  example,  the  i-th  controller  Q(z) 
in  the  set  C\(z), .. .  ,  C/v(r),  provides  robust  perfor¬ 
mance  for  any  plant  in  the  family 

Vi^iPWeV-.Wcc-otiWKmi)  (2) 

where  V\ completely  cover  the  plant  set  V, 
that  is 

N 

Vc{]Vi  (3) 

i=i 

Our  task  in  this  paper  is  to  explain  how,  when  a 
is  unknown  or  slowly  varying,  the  controller  C,(c) 
can  be  selected  to  control  the  plant.  This  type  of 
adaptive  controller  is  not  like  the  conventional  ones 
where  either  model  or  controller  parameters  are  di¬ 
rectly  adjusted.  Here  th£  parameters  are  contained 
in  the  mechanism  for  switching  amongst  the  prese¬ 
lected  robust  controllers,  which  is  essentially  a  gain 
scheduling  proceedure,  but  is  adaptive  in  the  sense 
that  the  schedule  is  being  learned  from  the  measured 
data.  The  gain  schedule  is  usually  set  in  advance,  for 
example,  in  a  hight  control  system  the  gain  schedule 
is  a  predetermined  function  of  the  Mach  number  and 
aerodynamic  pressure.  One  of  the  interesting  possi¬ 
ble  advantages  of  this  method  is  that  although  the 
plant  may  have  a  large  number  of  uncertain  param¬ 
eters  (a  in  the  above  notation),  it  is  possible  that 
only  a  few  toritrollers  are  required,  and  hence  only  a 
few  parameters  in  the  selection  mechanism.  Also  the 
individual  robust  controllers  can  be  based  on  uncer¬ 
tainty  in  physical  parameters  rather  than  canonical 
parameters,  such  as  transfer  function  coefficients,  as 
used  in  the  conventional  parameter  estimation  tech¬ 
niques,  e.g.,  least  squares  with  a  linear  regression 
model. 

The  idea  of  using  preselected  robust  controllers 
as  described  above  is  due  to  K.  Poola  [1],  who  also 
proposed  a  particular  smooth  adaptive  selection  al¬ 
gorithm.  Here,  we  examine  the  adaptive  selection 
mechanism  as  discussed  in  [2]  and  [3],  which  will  be 
described  in  detail  in  the  sequel.  In  [3]  this  scheme 
is  referred  to  as  a  Multiple  Model  Adaptive  Con¬ 
trol  (MMAC),  but  the  individual  controllers  are  not 
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necessarily  selected  to  be  robust  in  the  manner  de¬ 
scribed  above.  Here  we  also  provide  an  analysis  of 
the  convergence  properties  of  the  adaptive  selection 
algorithm  following  the  analysis  in  [2]. 


2  Structure  of  the  Adaptive 
Controller 


Figure  1  depicts  the  general  adaptive  set-up  that  we 
consider.  The  design  of  each  controller  C,(z)  is  not 
an  adaptive  control  design,  but  rather,  a  robust  con¬ 
trol  design  task  involving  the  plant  family  V  and 
the  coverings  Vi-  The  signals  used  to  adaptively  se¬ 
lect  the  controller  are  the  innovations  sequences,  or 
prediction  errors,  denoted  by  £i(i), . . .,£,\r(t),  where 
each  is  obtained  from  the  Kalman  filters  denoted 
I<Fu...,KFn. 

Although  there  are  many  structures  for  the  in¬ 
dividual  controllers  Ci{z),  one  structure  which  uti¬ 
lizes  the  prediction  errors  is  shown  in  Figure  2.  In 
this  case  C,(r)  consists  of  an  observer  based  state 
feedback  controller  together  with  an  auxiliary  sig¬ 
nal  obtained  by  processing  the  observer  innovations 
sequence  Ei (f)  through  a  stable  transfer  function 
Qi(z).  An  important  aspect  regarding  the  flexibil¬ 
ity  of  this  controller  structure  is  that  each  Ci(z)  is 
a  parametrization,  in  terms  of  stable  Qi(z),  of  all 
stabilizing  controllers  of  the  plant  model  P(z,Qi), 
see,  e.g.,[4].  That  is,  all  controllers  which  stabilize 
P(z,ai)  are  obtained  by  letting  Q,(c),  referred  to  as 
the  Youia  parameter,  range  over  all  stable  transfer 
functions. 

The  results  in  [2]  are  first  reviewd.  We  then  pro¬ 
vide  the  extension  to  the  feedback  case.  The  adapta¬ 
tion  mechanism  is  the  real  object  of  interest  in  this 
paper.  We  set  it  up  using  the  ideas  of  [2]  and  [3]. 
In  broad  outline,  the  idea  is  this:  Design  predictors 
(e.g.,  Kalman  filters)  which  are  optimal  for  each  of 
the  plants  />(z,o,),  i  =  I N .  (Their  design  also 
requires  values  to  be  assigned  to  noise  covariances.) 
We  then  process  the  innovations,  or  prediction  errors, 
from  each  predictor  and  compute  certain  quantities, 
roughly,  the  scalar  sample  covariance.  The  index 
value  i,  €  [1,...,/V]  whose  predictor  produces  the 
smallest  sample  covariance  is  selected  and  the  cor¬ 
responding  Cj.(r)  controller  is  used  to  control  the 
plant. 


3  Review  of  [2] 


3.1  Plant  in  Model  Set 


Let  a  plant  be  drawn  from  the  collection  {P(r,a,)  : 

*  =  1 . N},  say  the  plant  is  P(z, a*.).  Suppose 

the  plant  has  disturbances  and  measurement  noise, 
but  no  exogenous  input.  Let  N  Kalman  filters,  each 
tuned  to  {P(r,a-j)  :  i  =  1, be  connected 
to  the  plant.  Let  the  asymptotic  design  innovation 
covariances  be  fii  and  let  e,(t)  denote  the  sequence 
obtained  from  the  i-th  filter  connected  to  P(z, a,o), 
at  the  point  in  the  filter  where  the  innovation  se¬ 
quence  would  normally  be  observed.  Recall  that  the 
quantities  Ei(t)  and  fij  are  available  from  the  Kalman 
filter  and  covariance  equations,  respectively.  More¬ 
over,  the  design  covariance  Q,-  is  computible  in  ada- 
vance  of  the  measurements.  Note  also  that  Ei0(t)  is 
actually  an  innovation  sequence,  but  £,-(<)  for  i  ^  i0 
is  not  in  general.  Define,  for  i  ^  t'„, 


Li{t) 


Pfo.'lz*) 

P(«U*‘) 


(4) 


where  z *  denotes  the  sequence  of  measurements 
y(l), . . . ,  y{t),  and  p(a|z‘)  are  the  a  posteriori  prob¬ 
abilities  which,  assuming  gaussian  distributions,  are 
sequentially  computible  from 


E.(t)  =  exp{-ief(t)firle.(0}  (5) 


It  follows  immediately  that 


lnL,(t)  =  In  Li(t  —  I) 


t  det^.J 

2  det(fii) 

(6) 

Thus,  as  t  -*  oo,  and  for  all  i  i0, 

?ln[L,(t)]--(K-KJ 

(7) 

where  for  all  i, 

Vi  =lndet(n,)  +  tr(n,-1E<) 

(8) 

with 

(9) 

1  *=1 

=  £{£,(i)ff(0} 

(10) 

The  expectation  operator  £(•)  is  of  course  taken  with 
respect  to  the  noise  processes.  By  definition  of  opti¬ 
mality, 

Vi.  <  K-,  Vi  #  i0 

and  thus,  Vi  i„,  as  t  — <•  oo, 

pM**)  n 

P(ai.k‘) 


(11) 

(12) 


Two  issues  are  examined  in  [2].  First,  when  the  plant 
is  in  the  model  set,  and  secondly,  when  it  is  not. 
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exponentially  fast  with  asymptotic  convergence  rate 
exp{-(V<  -  This  means  that  as  t  — *  oo, 

(13) 

and  hence,  the  correct  plant  model  is  captured.  Intu¬ 
itively  this  is  as  expected,  because  the  plant  P(z,  a,-,) 
is  in  the  model  set  (P(c,a,)  :  i  =  1  and 

moreover,  one  of  the  Kalman  filters  is  optimal. 

3.2  Plant  Not  in  Model  Set 

Next,  in  [2],  the  case  is  considered  when  the  plant 
(and  noise  covariances)  is  not  in  the  model  set. 
Specifically,  let  the  true  plant  be  given  by 

P°(z)  =  <*«)[!  +  A(c,or#)]  (14) 

for  some  a„  €  A,  where  now  none  of  the  Kalman 
filters  is  optimal.  The  question  is:  if  one  acts  as  if 
the  true  plant  were  in  the  model  set,  and  the  prob¬ 
abilities  are  computed  as  before,  then  what  actually 
happens? 

The  answer  is  that  the  quantities  p(a,|i‘)  com¬ 
puted  from  (5),  which  might  now  be  more  correctly 
referred  to  as  pseudo-probabilities,  behave  exactly  as 
before,  namely,  that  as  l  — »  oo, 

-  { J:  ; ; i  <i5> 

where 

».  =  arg  min  V,  (16) 

Hence,  the  algorithm  selects  the  plant  model 

P(z,Qi.)  in  the  model  set  {P(j,a,)  :  i  =  l,...,Af} 
which  is  closest  to  the  true  plant  P0(z)  in  the  sense 
of  minimizing  VJ  over  all  i.  As  shown  in  [2],  this 
is  equvalent  to  minimizing  the  Kullback  information 
measure  computed  on  an  asymptotic  per  sample  ba¬ 
sis. 

We  remark  also  that  if  $,  (w)  ami  !>„(-;)  are  the 
model  and  plant  output  spectrum,  respectively,  then 

*<(«)  =  Wi(e^)fi,H'1r(e-^)  (17) 

=  W0(e^)njVj(e-’")  (18) 

where  Wfl(z)  and  \V~  !(c)  are  the  transfer  functions 
from  the  measurement  sequence  y(t)  to  the  model 
innovations  £,(<)  and  true  innovations  £■„(/),  respec¬ 
tively.  Note  that  these  transfer  functions  can  always 
be  selected  to  be  stable  and  stably  invertible.  Hence, 
we  can  compute  £*  by 

Ei  =  h  S\  wr'^w)^)[\vr\t-n\T  du 

(19) 


and  also 

tr ■(nr1E<)=s  tr^oM^fV)]^  (20) 

Finally,  we  get 

Vi  =  In  det(Qj)  +  ~  J  tr[$a(w)*f  l(w)J  du  (21) 

4  The  Effect  of  Feedback  and 
Exogenous  Inputs 

We  work  now  to  consider  the  arrangement  depicted 
in  Figure  3.  Let  the  plant  system  be  described  by 

2/(0  =  P0(z)u(t)  -f  Wa(z)»0(t)  (22) 

where  u„(t)  is  a  zero  mean  white  noise  sequence  with 
covariance  matrix  fi„.  The  feedback  system  is 

u(t)  =  C(z)[r(0  -  y(0]  (23) 

where  r(t)  is  a  stationary  zero  mean  sequence  with 
spectrum  $r(w).  The  controller  C(z )  may  be  one 
of  the  available  set  of  controllers  Ci(z), . . . ,  Cn[z). 
Recall  that  each  controller  Cj(z)  is  designed  to  sta¬ 
bilize  any  plant  in  the  set  V,  defined  in  (2).  Observe 
that  if  the  true  plant  P0(z)  is  not  in  this  set,  then 
the  closed  loop  system  may  be  unstable  if  Cj(z)  is 
applied. 

The  block  labeled  KFi  denotes  a  Kalman  filter 
which  is  designed  for  the  plant  model 

2/(0  =  P(z,c*i)v(t)  +  W(z,o,>f(0  (24) 

where  u^t)  is  a  zero  medii  white  noise  sequence  with 
covariance  matrix  0,-.  Hence,  the  innovations  se¬ 
quence  £j(0  from  KFi  is  given  by 

* (0  =  W'~lt*.«,)ty(0  -  p(*.  «.>(<))  (25) 

The  difference  here,  in  relation  to  [2]  is:  (i)  the  in¬ 
clusion  of  an  external  inpht  u(f)  to  the  plant  and  the 
Kalman  filter,  and  (ii)  the  generation  of  that  input 
by  a  combination  of  an  exogenous  input  r(t)  and  a 
feedback  compensator  C(z).  We  now  examine  how 
the  filter  selection  algorithm  (5)  behaves  in  two  cases, 
namely,  when  C(z)  stabilizes  P0{z)  and  when  it  does 
not. 

4.1  Stabilizing  Feedback 

Suppose  that  the  feedback  compensator  C(r)  sta¬ 
bilizes  P0(-)-  In  this  case  the  pseudo-probabilities 
p(ai|c‘)  are  again  computed  from  (5),  but  now  we 
take  the  set  of  measurements  as  z‘  =  (y(ifc),  u(k)  : 
k  =  The  result  (15),  (16)  is  the  same  as 
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before:  the  algorithm  selects  the  plant  model  that  is 
closest  in  the  sense  of  the  information  measure,  i.e., 
that  plant  model  P(z,cii),  W(z,cti)  which  is  closest 
to  the  true  plant  P0(z),  W0(z)  in  the  sense  of  mini¬ 
mizing  Vi  in  (21).  What  is  different  is  the  formula 
for  Vi .  To  compute  this  first  observe  that  (we  drop 
the  explicitly  shown  dependence  on  z,t,  and  Oj  to 
simplify  notation) 


o\ 

II 

s 

1 

:p 

'c" 

+ 

o 

(26) 

and  under  the  feedback  action 

u  =  C(I  +  PoC)~l[r-  U>0] 

(27) 

Note  that  by  saying  that  C(z)  stabilizes  P0{z)  we 
mean  that  the  transfer  functions  C{I  +  P0C)~l ,  (7  + 
P„C)-1,  and  P0(I  +  P0C)~l  are  all  stable.  Hence, 
we  get 

e<  =  HtW0»o  hG,v  (28) 

where 

Hi  =  Wr\l  +  PiC)(I  +  PoC)-1 

(29) 

and 

Gi  =  Wr\P0  -  P,)C(7  +  P0C)- 1 

(30) 

Assuming  that  r(t)  and  v0(t)  are  uncorrelated,  then 

<M“)  = 

+Gi(e^)<t>r(v)Gf(e-n  (31) 


and  hence 

=  27  (32) 

To  get  better  insight  into  this  formula,  observe  what 
happens  when  we  model  the  plant  input-output  dy¬ 
namics  correctly,  that  is,  when  P,  =  P0 .  In  this  case 
the  presence  of  the  external  input  has  no  effect  and 
we  get 

W;-1(^)^0(w)(H'(-,(e-^))Tdu; 

J~*  .  (33) 

as  before.  In  the  more  general  cax*  » :’h  a  stabiliz¬ 
ing  feedback,  the  above  formula  for  L‘,  u  .-ffected  by 
errors  in  both  the  dynamical  ino-J-l  !  to  P0{z)  - 
P(z,Qi)  and  in  the  noise  model  via  :  t  -  U'(r,  a,). 
Signal  to  noise  ratio  is  also  rel-v  f  r  -xample, 
if  <tr(w)  >>  $„(«),  l^en  the  t.-rm  ....  G,(z) 

tends  to  dominate,  i.e.,  the  noi«*  i..  ‘  I  r  is  not 
as  important  as  the  dynamical  m  :  ■  rr  r 

Other  insights  can  also  be  obt.ui.-  1  l  j  u-mg  the 
expressions  derived  in  [5],  such  as 

ei  =  r0  +  W-1fi(“a)  (34) 

where 

fi  =  [Po  -Pi  W0-  W,}  (35) 


4.2  Unstabilizing  Feedback 

It  could  be  that  the  adaptive  selection  algorithm  (5) 
picks  out  a  control  which  would  stabilize  one  of  the 
plant  models,  but  de-stabilizes  the  true  plant.  The 
question  arises  as  to  what  happens  if  the  same  algo¬ 
rithm  is  used  to  drive  the  adaptive  switch.  Clearly, 
the  limiting  process  in  (21)  will  no  longer  be  valid. 

Roughly  what  happens  is  this:  if  the  dominant 
unstable  mode  of  the  closed  loop  system  is  at  z  = 
C,  |C|  >  1.  then  the  selection  algorithm  identifies 
that  plant  P(z,ai)  for  which  P(C,a,)  is  “most  like” 
Po( C).  If  P(C,Q,i)  =  Po(0.  then  the  controller  C,(r) 
appropriate  for  P(z, Oi),  when  connected  to  P0(c), 
will  not  produce  an  unstable  mode  at  £.  Nor  will 
it  do  so  if  P(£,a,)  ss  P»(C),  because  the  controller 
is  robustly  designed  to  stabilize  all  plants  in  the  set 
Pi  defined  in  (2).  However,  there  is  no  a  priori  guar¬ 
anty  that  it  will  give  no  other  unstable  mode  C+  / 

So  despite  the  fact  that  there  is  a  systematic  way  of 
switching  out  an  unstable  controller,  there  is  no  guar¬ 
anty  that  instability  can  be  instantaneously  repaired. 
One  could  randomly  select  any  alternative  control 
from  the  set  Ci(z), ..  .,Cjv(z),  or  better,  whenever 
instability  is  deemed  to  occur,  say  whenever  ||t/(t)|| 
exceeds  some  threshhold,  a  stabilizing,  but  low  au¬ 
thority  controller,  can  be  switched  in  until  the  adap¬ 
tive  system  selects  a  new  plant/  controller. 


5  Algorithm  Modifications 


The  pseudo-probability  algorithm  (5)  can  be  modi¬ 
fied  in  a  number  of  ways.  The  influence  of  past  data, 
which  tends  to  make  the  algorithm  sluggish,  can  be 
reduced  by  discarding  old  data.  For  example,  sup¬ 
pose  that  the  controller  selection  is  to  be  based  on 
the  smallest 


Vi(t)  =  lndet(ni)  +  f£y-k±eT(k)(l7'ei(l-) 

1  k  =  1 

(36) 

where  A  £  (0, 1)  is  the  usual  “forgetting  factor”  which 
exponentially  discards  old  data.  Let  p<(t)  denote  the 
pseudo-probability  associated  with  K(<)  above.  The 
algorithm  (5)  now  becomes 


Define, 


i.  =  arg  min  Vj(t) 

I 


(38) 


and  for  »  ^  i., 


Li(t) 


p.(  0 
Pi.(0 


(39) 
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As  in  the  previous  cases,  it  can  be  shown  that  as 
t  — <•  oo, 

«<•>-{  wt  <40> 

exponentially  fast. 


6  Concluding  Remarks 

We  have  revisited  some  earlier  work  on  multiple 
model  adaptive  control  and  have  introduced  some  of 
the  ideas  of  robust  control  into  the  procedure.  The 
controller  parametrization  in  Figure  2  leads  one  to 
hope  that  a  more  robust  multiple  model  controller 
has  the  form 


N 

«(0  =  {Q.(’k.(0}  (4i) 

i=i 

Compare  this  to  the  ’’natural”  choice  given  by 
N 

«(<)  =  5><0  toWKO  -  5/(0]}  (42) 

t=i 

Apparently  a  periodic  controller  switching  mecha¬ 
nism  can  lead  to  the  above  ’’robust”  form,  e.g.,  [1], 
but  this  is  not  verified  for  the  simpler  switching 
mechanism  considered  here.  In  any  event,  it  may 
be  necessary  to  also  provide  for  a  back-up  controller 

«o(0  =  Co(*)[r(t)-*(0)  (43) 

which  is  of  low-authority,  but  stabilizes  all  plants  in 
V.  This  control  can  be  switched  in  whenever  an  “in¬ 
stability”  is  deemed  to  occur. 
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Figure  3:  Fixed  Feedback  and  Multiple  Filters 


On  The  Use  of  The  Method  of  Averaging 
For  The  Stability  Analysis  of 
Adaptive  Linear  Control  Systems 


Robert 


Abstract  -  The  uses  and  limitations  of  the  method  of  av¬ 
eraging  are  discussed  with  application  to  linear  parameter 
adaptive  systems.  The  method  of  averaging  has  shown  to 
provide  a  means  to  explicitly  determine  some  of  the  im¬ 
portant  aspects  of  adaptive  system  performance,  but  there 
are  inherent  limitations,  e.g.,  slow  adaptation  and  param¬ 
eters  restricted  to  trajectories  within  the  constant  param¬ 
eter  stability  set.  Remedies  to  alleviate  these  restrictions 
are  discussed,  based  primarily  on  the  use  of  a  fixed  point 
analysis. 

1.  INTRODUCTION 

Although  uncertainty  underlies  the  reason  for  using  an 
adaptive  control  system,  too  much  of  the  wrong  kind  of  un¬ 
certainty  may  be  cause  for  grief.  For  example,  simulations 
of  some  simple  adaptive  control  systems,  under  apparently 
minor  non-ideal  conditions,  have  shown  degraded  perfor¬ 
mance  and  even  instabilities,  e.g.,  Rohrs  et  a!.(I98I, 1982). 
In  this  paper  we  examine  the  uses  and  limitations  of  the 
method  of  averaging  for  the  stability  analysis  of  adaptive 
linear  control  systems. 

A  mathematical  treatment  of  averaging  can  be  found  in 
Chapter  5  of  Hale  (1969)  or  in  Chapter  4  of  Guckenheimer 
and  Holmes  (1983).  Applications  to  adaptive  systems  can 
be  found,  to  name  a  few  sources,  in:  Ljung  and  Soderstrom 
(1983),  where  stochastic  averaging  methods  are  employed 
to  study  the  asymptotic  parameter  trajectories  of  recursive 
parameter  estimation  algorithms  in  system  identification 
methods;  Astrom  (1983,  1984)  showing  how  the  method 
of  averaging  explains  instabilities  and  drift;  Reidle  and 
Kokotovic  (1985)  study  averaging  for  the  linearized  adap¬ 
tive  system  and  establish  a  sharp  stability-instability  fre¬ 
quency  domain  test;  Kosut,  Anderson,  and  Mareels  (1987) 
on  the  relation  between  averaging  and  persistent  excita¬ 
tion;  Bodson  et  al.(1986)  on  nonlinear  averaging  analysis 
and  determining  the  rate  of  convergence;  Riedle  and  Koko¬ 
tovic  (1986)  on  the  slow  integral  manifold  analysis;  and 
the  monograph  by  Anderson  et  al.(1986),  which  uses  the 
method  of  averaging  and  a  total  stability  approach. 

*Dr  Kosut  is  a  Senior  Research  Scientist  at  Integrated  Systems, 
Inc.,  2500  Mission  College  Blvd.,  Santa  Clara,  CA  95054,  and  is 
a  Consulting  Professor  in  the  Dept,  of  Electrical  Engineering  at 
Stanford  University 

*  Research  supported  by  AFOSR,  Directorate  of  Aerospace  Sci¬ 
ences,  under  Contract  F49620-85-C-0094. 
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Uses  and  Limitations 

The  method  of  averaging  provides  a  great  deal  of  insight 
into  the  behavior  of  adaptive  systems.  The  analysis  pro¬ 
vides  quantitative  tests  for  determining  local  stability  and 
instability,  rate  of  convergence,  and  robustness  to  unmod¬ 
eled  effects.  But,  it  is  not  a  panacea,  as  there  are  inherent 
restrictions  in  the  analysis.  In  the  first  place,  the  speed  of 
adaptation  required  to  satisfy  the  theoretical  conditions  is 
most  often  far  below  that  as  determined  from  simulations. 
Secondly,  the  results  are  valid  only  when  the  parameters 
are  restricted  to  a  subset  of  the  constant  parameter  stabil¬ 
ity  set.  Projection  techniques  can  be  employed  to  restrict 
the  parameters  as  required,  but  in  many  practical  cases 
this  is  not  feasible  nor  even  necessary,  e.g.,  in  output  error 
system  identification  and  adaptive  control,  momentary  un¬ 
stable  parameter  settings  can  induce  very  rapid  learning, 
e.g.,  Anderson  (1985).  However,  the  exact  mechanism  is 
not  well  understood.  Also,  slow  adaptation,  if  it  is  used, 
can  be  counter-productive  in  some  instances  because  per¬ 
formance  can  be  below  par  for  the  long  period  of  time  it 
takes  for  the  parameters  to  adjust. 

Transient  Analysis 

To  remove  these  restrictions  requires  understanding  the 
transient  behavior  of  adaptive  systems.  Some  preliminary 
results  are  reported  in  Kosut  et  al.(1987)  e".!  Kosut  and 
Bitmead  (1986).  The  tools  for  analysis  involve  a  combi¬ 
nation  of  small  gain  theory,  passivity,  and  the  method  of 
averaging,  with  these  all  linked  together  by  the  Contrac¬ 
tion  Mapping  Principal.  Some  of  these  ideas  will  be  briefly 
described  in  this  paper. 

Beyond  Hand  Calculations 

Although  each  of  these  tools,  in  principal,  involves 
straightforward  calculations,  even  simple  examples  can 
just  barely  be  worked  out  by  hand.  It  is  clear  that  the 
level  of  complexity  of  a  realistic  adaptive  system  is  well 
beyond  hand  calculation.  Hence,  in  order  for  any  of  the 
above  mentioned  analytic  methods  to  be  of  practical  bene¬ 
fit,  it  is  imperitve  to  develop  “user-friendly”  software  tools 
which  provide  the  requisite  nonlinear  analysis. 

At  the  present  time,  aside  from  simulation  capability, 
there  are  no  available  software  tools  for  dealing  principally 
with  adaptive  systems.  This  is  a  research  issue  in  both 
mathematics  and  computation,  and  it  is  one  that  is  essen- 
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tions.  Thus,  the  parameter  adaptive  system  of  Figure  1  is 
an  adaptive  linear  control  system,  which  can  be  described 
by  the  coupled  set  of  ordinary  differential  equations 


Figure  1:  Parameter  adaptive  control  system 


tial  to  a  continuing  study  of  adaptive  systems.  This  point 
will  not  be  pursued  here,  but  it  is  sometimes  an  overlooked 
research  issue,  and  thus  warrents  more  than  a  passing  re¬ 
mark. 

II.  LINEAR  ADAPTIVE  SYSTEMS 
Adaptive  System  Structure 

Figure  1  depicts  a  parameter  adaptive  control  system 
where  u(t)  and  y(t)  are  the  vectors  of  measured  inputs 
and  outputs,  w(i)  is  a  vector  of  exogenous  inputs,  i.e.,  ref¬ 
erences,  disturbances,  and  noise  sources,  and  the  vector 
of  adaptive  parameters  is  0(t)  with  initial  value  60.  The 
adaptive  parameter  vector  is  mapped  into  a  “control”  pa¬ 
rameter  vector  p(t)  by  some  design  rule,  typically  an  im¬ 
plicitly  defined  memoryless  nonlinear  function  .  For  exam¬ 
ple,  in  control  system  synthesis,  one  can  use  any  number 
of  model  (parameter)  based  methods,  see  e.g.,  Safonov  et 
al.  (1981)  on  LQG  based  designs  or  Vidyasafcar  (1985)  on 
the  stable  factorization  approach.  In  system  identification, 
on  the  other  hand,  we  normally  have  a  the  simple  design 
rule  8  =  p.  A  more  extensive  display  of  how  the  vari¬ 
ous  standard  adaptive  identification  and  control  systems 
fit  into  the  structure  of  Figure  1  is  provided  in  Anderson 
et  al.(1986). 

In  order  to  illustrate  the  basic  ideas,  we  will  assume 
here  that  the  parameter  adaptive  algorithm  is  contiuously 
adjusted.  In  practice,  parameter  adjustments  would  be 
either  at  discrete  times  or  the  control  signal  is  the  output 
of  a  digital  computer. 

We  will  also  assume  that  for  fixed  values  of  the  con¬ 
trol  parameter  vector  p,  the  plant  and  controller  are  both 
linear-time-invariant  with  rational  proper  transfer  func- 


x  =  A(6)x  +  B(8)w(t) 

9  =  7  g(t,x,9)  (1) 

where.0(t)  €  Rp  is  the  adaptive  parameter,  and  x(t)  €  R" 
is  the  system  state,  consisting  of  plant,  controller,  filters, 
and  possibly,  reference  model  states.  The  matrix  functions 
A(8),  B(8)  are  determined  by  the  design  rule  8  i-»  p  and  the 
parametric  controller  structure.  The  nonlinear  function 
q(i,x,9)  and  the  adaptation  gain,  7,  a  positive  constant, 
are  determined  by  the  choice  of  algorithm. 

Constructing  the  Parameter  Adaptive  Algorithm 

We  will  restrict  our  discussion  here  to  the  gradient  al¬ 
gorithm 

8  =  7  4>c  (2) 

where  d>(t)  6  Rp  is  referred  to  as  the  regessor,  and  c(t)  € 
R  as  the  adaptation  error.  In  an  indirect  adaptive  control, 


a  typical  error  is 

the  equation  error 

t  —  y  —  8T  4> 

(3) 

with  control 

u  =  ~pTxp 

(4) 

and  design  rule 

II 

* 

(5) 

where  the  elements  of  d>(t)  e  Rn*  consist  of  filtered  ver¬ 
sions  of  (y,  u),  and  the  elements  of  ip(t)  e  R"'  consist 
of  filtered  versions  of  (y,  u,r),  where  r(t)  is  the  reference 
command.  In  a  direct  adaptive  control  system  we  can  take 
the  error  as 

e  =  y  -  Vm  (6) 

with  control 

1 i  =  -8T<t>  (7) 

where  ym(t)  is  the  reference  model  output. 

Goal  of  Adaptation 

In  general,  adaptive  algorithms  share  the  same  genesis 
and  purpose  no  matter  how  they  are  constructed.  Namely, 
to  adjust  9(t)  so  that  it  asymptotically  approaches  a  mem¬ 
ber  of  the  set 

Ttft  =  {8  £  Rn*  :  »vg{eJ}  is  a  minimum)  (8) 
where  avg{  )  is  defined  by 

avg{Y)  =  lim  ^  /  X{t)dt  (9) 

T-*«  1  J  0 

In  the  ideal  case,  the  parametrization  can  be  selected  so  as 
to  acheive  what  is  called  perfect  matching,  e.g.,  the  set  7^% 
has  a  single  member  such  that  e(t)  =  0  or  is  white  noise. 
Under  more  realistic  conditions,  the  best  that  can  happen 
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is  that  6(t)  asymptotically  approaches  a  small  neighbor¬ 
hood  of 

*.  =  {0  €  Rn*  :  avg{^c}  =  0,  avg{^T)  >  0}  (10) 

If  for  fixed  6,  the  regressor  d(t )  is  constructed  to  be  iden¬ 
tical  with,  or  proportional  to,  —dc{i)J 80 ,  then  T,  is  the 
set  of  all  local  minima  of  avg{eJ}.  Since  de(t)/d6  is  a 
function  of  the  true,  but  unknown  plant,  the  regessor  can 
at  best  be  constructed  as  an  approximation.  Constructing 
the  adaptive  algorithm  in  this  fashion  has  been  referred 
to  as  the  “MIT- Rule”,  see,  e.g.,  Whitaker  (1959)  and  Ma- 
reels  et  al.(1986). 

From  a  practical  point  of  view  it  is  acceptable  that  the 
parameters  approach  and  remain  in  a  small  neighborhood 
of  T.,  provided  that  members  of  this  set  also  produce  ac¬ 
ceptable  performance. 

The  Tuned  System 

Assuming  this  is  so,  let  6 .  €  Rn*  denote  such  a  setting, 
of  which  there  could  be  many.  We  refer  to  each  6.  as  a 
tuned  parameter  and  to  the  corresponding  system 

i,  =  4(«.)i  +  B(«.H<)  (11) 

as  the  tuned  system  [see,  e.g.,  Kosut  and  Friedlander 
(1985)].  Clearly,  the  tuned  system  is  the  same  as  sys¬ 
tem  (1)  but  with  0(t)  fixed  at  0..  We  can  now  pose  the 
following  questions  regarding  the  adaptive  system  (1): 

(1)  How  do  the  tuned  parameters  depend  on  the  exoge¬ 
nous  inputs? 

(2)  Is  the  adaptive  system  stable  in  a  neighborhood  of 
the  tuned  system,  i.e.,  are  solutions  (x ,  0)  stable  near 
(*.(t),0.)?  Furthermore,  how  small  is  this  neighbor¬ 
hood? 

(3)  What  is  the  region  of  attraction  in  ( x,0 )  to  a  small 
neighborhood  of  (x.(f),0.)? 

(4)  What  is  the  rate  of  convergence  to  this  small  neigh¬ 
borhood  of  (x.(t),9.)? 

Some  of  these  questions  can  be  answered  by  the  method 
of  averaging. 

III.  AVERAGING  ANALYSIS 

The  method  of  averaging  applies  to  a  differential  equa¬ 
tion  of  the  form 

±  =  7 /(<,*)  (l2) 

where  7  is  a  positive  constant.  If  7  is  sufficiently  small, 
then  intuitively,  the  stability  of  the  time-varying  system 
(12)  is  inherited  from  the  stability  of  the  simpler  au¬ 
tonomous  averaged  system 

x  =  7avg{/(-,*)}  (13) 

More  precisely,  following  Hale  (1969)  or  Guckenheimer  and 
Holmes  (1983),  we  have: 


Theorem  1  Suppose  that  f(t,x)  is  almost  periodic  in 
t  and  twice  differentiable  in  x,  for  all  x  in  a  bounded  set. 
Under  these  conditions: 

(ij  If  x(t)  and  xa(<)  are  solutions  of  (12)  and  (IS),  re¬ 
spectively,  then  ||x(0)  —  *.(0)||  =  0(~f)  implies  that 

«*(<)  -*.(011  =  0(7).  V*€[0. 0(1/7)]. 

(ii)  Suppose  there  exists  x°  satisfying 

avg{/(-,*#)}  =  0  (14) 

Then,  for  all  small  7  >  0,  there  exists  a  unique  solu¬ 
tion  x7(t)  of  (IS)  with  the  following  properties: 

(ii-a)  As  7  — ♦  0,  xy(t)  — ►  x# 

(ii-b)  xy(t)  is  uniformly  asymptotically  stable  (u.a.s.) 

if 


maxRe  Ai[B(z‘>)]  <  0 

• 

(15) 

rii-c)  xy(t)  is  unstable  if 

maxRe  Aj[B(x®)]  >  0 

I 

(16) 

where  x  1 -*  B(x)  is  the  matrix  function 

B(x)  =  ^avg{/(.,x)}  (17) 


In  order  to  apply  the  above  result  to  the  adaptive  sys¬ 
tem  (1)  it  is  first  neccesary  to  make  a  transformation  of 
variables,  so  that  the  resulting  system  has  the  appropriate 
form  as  expressed  by  (12).  The  transformation  is  referred 
to  as  a  time-scale  decomposition. 

Time-Scale  Decomposition 

From  the  previous  discussions  about  the  origins  of  the 
adaptive  algorithm,  we  are  clearly  interested  in  the  be¬ 
havior  of  the  adaptive  system  in  the  neighborhood  of  the 
tuned  system.  But  the  tuned  constant  parameter  setting 
0.  is  not  known  before  hand.  Hence,  following  the  procee- 
dures  given  in  Anderson  et  al.(1986),  we  study  the  behav¬ 
ior  of  (1)  in  the  neighborhood  of  all  constant  parameter 
solutions.  For  this  purpose,  let  x(t,6)  denote  the  state  x(t) 
when  7  =  0.  We  refer  to  x(t,0)  as  the  frozen  parameter 
system  state ,  or  frozen  state  for  short.  Hence,  for  each 
$  €  Rn#,  x(t,  0)  satisfies  the  partial  differential  equation 

dx/dt  =  A(0)x  +  B{6)w{t)  (18) 

By  introducing  the  error  state 

»»(<)  =  x(t)-z(t,0(t))  (19) 

the  (x,0)- system  of  (1)  can  be  transformed  into  the  equiv¬ 
alent  (q,  0)-system: 

6  =  7 /(*,  8, 7) 

q  =  A(0)q- 7$(t,0,q)  (20) 
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where  the  functions  /  and  g  are  given  by 


/(M.*?)  =  f(*(*.0)  +  l) 

sii.e.n)  =  [dx(t>0)/dW,0,v)  (21) 

The  transformation  of  (1)  into  (20)  is  referred  to  as  a 
timescale  decomposition,  because  for  small  7,  6(t)  changes 
much  more  slowly  than  q(l).  The  averaged  system  which 
describes  the  parameter  trajectories  for  small  7  is  then 

<i  =  7avg ;{/(-,  M)}  (22) 

Using  Theorem  1,  the  stability  analysis  of  the  adaptive 
system  can  be  divided  into  answering  questions  about  its 
asymptotic  and  transient  characteristics,  such  as: 

(1)  Asymptotic  analysis:  What  are  the  stability  proper¬ 
ties  of  (20)  in  the  neighborhood  of  ( 9 ,  tj)  =  (6. ,  0)? 

(2)  Transient  analysis :  What  is  the  rate  of  convergence 
and  region  of  attraction  of  (20)  to  a  small  neighbor¬ 
hood  of  (0.,O)? 

Asymptotic  Analysis 

The  definition  of  the  tuned  system  (x.,0.)  as  well  as 
the  stability  of  the  adaptive  system  in  the  neighborhood 
of  the  tuned  system  can  be  answered  by  Theorem  4.2  in 
Anderson  et  al.(1986)  or  Section  3  in  Bodson  et  al.(1985). 
The  flavor  of  these  results,  which  follows  from  Theorem  1, 
can  be  stated  as  follows: 

Theorem  2  Suppose  that  the  right  hand  sides  in  (1)  are 
almost  periodic  in  t  and  twice  differentiable  in  both  x  and 
0,  for  all  x  and  6  in  bounded  sets.  Under  these  conditions: 

(i)  If  0a(t)  is  a  solution  of  (22),  and  if  0a(O)  = 
0(O),x(O,0(O))  =  z(0),  then  Vt  €  [0, 0(1/7)],  ||0(<)  - 

*.(011  =  0(7)  *»d  ||*(t)  -  *(<A(<))||  =  0(7). 

(ii)  Let  the  tuned  parameter  set  be  defined  as  those  0,  E 
Rn*  which  satisfy 

*v«{/(»0*.o)}  =  ° 

Re  A[A(0.)]  <  0  (23) 

TAen,  for  all  small  7  >  0,  there  exists  a  unique  almost 
periodic  solution  07(t),z7(t)  of  (1)  with  the  following 
properties: 

(ii-a)  As  7  — ►  0,  91(t)  — ♦  0.  and  x7(f)  — *  *.(*) 

(ii-b)  07(<),x7(*)  is  uniformly  asymptotically  stable 
(u.a.s.)  if 

max  Re  Ai[R(0.)]  >  0  (24) 

(ii- c)  07(t),x7(t)  is  unstable  if 

max  Re  A ,  (R(0. )]  <  0  (25) 


where  6  *-►  R(0 )  is  the  matrix  function 

*(0)  =  -^avg{/(.,0,0)}  (26) 

!Jhe  sharp  stability-instability  boundary  expressed  by 
(24)  and  (25)  allows  not  only  for  an  assessment  of  a  partic¬ 
ular  design,  but  also  indicates  how  to  modify  and  improve 
the  algorithm.  This  is  easier  to  illustate  by  using  a  specific 
application.  Consider  the  direct  adaptive  control  system 
(2),(6)-(7).  The  matrix  R(6.)  required  in  Theorem  2  is 
then 


R(9.)  =  avg{*.(tff„*.)T  +  e.(/W.)T)  (27) 

where,  referring  to  Figure  1,  H,v  and  H^v  are  the  closed- 
loop  transfer  functions  with  0(t)  =  0. ,  from  an  exogenous 
input  v(<)  inserted  at  the  control  input  to  the  adaptation 
error  e  and  the  regressor  <f>,  respectively.  Also,  e.(t)  is  the 
tuned  error,  which  in  the  ideal  case  would  be  white  noise 
independent  of  (H^v^.)(t),  and  hence,  the  second  term 
above  is  zero.  In  general,  £.(<)  is  small,  and  so  the  second 
term  can  be  neglected. 

Suppose  that  ^.(f )  is  almost  periodic  with  Fourier  series 
representation 


M0  =  £a(w)£~iW‘  (28) 

wen 

where  Q  is  the  set  of  distinct  Fourier  frequencies  and  a(w) 
the  corresponding  coefficients.  Then,  with  e.(t)  small, 

f ’.)  *  £*(«)»..(-;«•') 

wen 

$(w)  =  o(w)dT(w)  (29) 

Observe  that  for  (24)  to  hold  it  is  necessary  that  R{6.)  is 
non-  singular,  or  eqivalently 

!>(")>  0  (30) 

wen 


This  is  the  usual  condition  for  j.(t)  to  be  persistently  ex¬ 
citing,  sometimes  referred  to  as  the  condition  for  uniform 
identifiability  of  the  parameters. 

In  the  indirect  case  (2),(3)-(4),  the  expression  for  R{6.) 
is  given  by 


*(0.)  =  »vg{*.*r  +  [MHtvd>.)T  +  MJWD]^(0.)) 

(31) 

where  H,v,Hpv  are  closed  loop  transfer  functions  defined 
is  the  same  sense  as  before.  However,  in  the  ideal  indirect 
case,  with  so  unmodeled  dynamics,  both  H„  and  //*,„ 
vanish.  A  derivation  of  the  above  expression  and  further 
discussion  can  be  found  in  Philips  et  al.(1987). 


We  also  remark  that  for  any  adaptive  system,  the  local 
stability  test  matrix  R{6)  can  be  evaluated  at  any  candidi- 
ate  value  of  9,  not  just  0 .  as  defined  in  Theorem  2.  In  this 
case  we  can  insure  stability  in  a  small  neighborhhod  of  the 
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candidate  parameter  value,  provided  that  the  adaptation 
error  e(t)  is  sufficiently  small  at  the  candidate  parameter 
value,  see,  e.g.,  Anderson  et  al.(1986). 

Signal  Dependent  Positivity  Condition 

Since  for  any  square  matrix  M,  we  have  the  inequality 
Re  A (M)  <  (l/2)A(Af  +  MT),  it  follows  that  a  sufficient 
condition  for  (24)  to  hold,  with  R(9.)  given  by  (29),  is  that 

nunAj{£  «(w)ReAr,v(ia-)}  >  0  (32) 

'  uen 

This  is  the  local  stability  condition  first  stated  by  Riedle 
and  Kokotovic  (1985),  and  referred  to  as  a  signal  depen¬ 
dent  positivity  condition.  To  provide  some  interpretation, 
suppose  that  H,v(s)  is  stridy  positive  real  (SPR),  1 

Re  Htv(ju)  >  6 |ff„(ju/)|J  , Vw  €  R  (33) 

where  6  is  a  positive  constant.  Condition  (32)  is  clearly 
significantly  less  restrictive  than  the  SPR  condition  (33), 
which  restricts  R eH,v(ju)  at  each  frequency,  whereas  (32) 
is  a  weighted  sum  of  each  Re  Htv(ju)  contribution,  hence 
the  terminology  mentioned  above.  Local  stability  is  as¬ 
sured  as  long  as  the  contributions  from  those  frequen¬ 
cies  where  R eHtv(jw)  <  0  are  dominated  by  those  where 
KeH,v(ju)  >  0.  Although  (32)  has  a  nice  interpretation, 
in  light  of  (24)  it  is  not  necessary  to  be  so  conservative. 
In  fact,  it  is  easy  to  construct  an  example  where  (32)  fails 
but  (24)  holds  nonetheless. 

Transient  Analysis 

An  understanding  of  the  transient  properties  of  the 
adaptive  system  requires  answering  the  questions  posed 
before,  namely,  determining  the  region  of  attraction  to  a 
small  neighborhood  of  the  tuned  system,  and  the  rate  of 
convergence.  The  following  result,  based  on  averaging,  can 
be  found  in  Anderson  et  al.(1986). 

Theorem  3  Under  the  conditions  stated  in  Theorem  S, 
if  the  initial  parameter  value  6„  is  strictly  inside  a  convex 
subset  of  the  constant  parameter  stability  set,  then  0(f)  will 
remain  in  that  set  and  converge  exponentially  at  a  rate  no 
slower  than  0( 7)  to  the  u.a.s.  almost  periodic  solution 
&,(*)  defined  in  Theorem  l. 

Some  of  the  restrictions  can  be  relaxed,  see,  e.g.,  Riedle 
and  Kokotovic  (1986)  or  Bodson  et  al.(1985).  Specifically, 
the  region  of  attraction  can  be  relaxed  to  a  compact  subset 
of  the  constant  parameter  stability  set;  also,  estimates  of 
the  rate  of  convergence  far  from  the  tuned  setting  do  not 
have  to  be  0(7). 

These  type  of  results  are  pleasing  from  an  intuitive  point 
of  view,  thus  providing  a  qualitative  analysis.  However, 
they  do  not  apply  when  the  parameters  leave  the  constant 
parameter  stability  set,  nor  do  they  provide  quantitative 

*Th«  SPR  condition  is  not  just  putted  out  of  the  air,  it  »ri»e»  u  the 
condition  to  inaurc  boundedneta  of  the  parameter  estimate*,  provided 
there  are  no  unmodeled  dynamics  or  disturbances,  see,  e.g.,  Narendra 
et  al.(1980).  Eaaentially,  the  SPR  condition  forces  the  algorithm  to 
never  move  'uphill*  when  searching  for  the  minimum. 


performance  measures.  In  addition,  we  still  require  slow 
adaptation,  which  in  itself  is  not  too  objectionable,  per¬ 
haps  only  cautious,  but  again  specific  bounds  on  the  adap¬ 
tion  gain  extracted  from  the  theory  tend  to  be  extremely 
conservative.  In  the  next  section  we  show  how  it  may  be 
possible  to  remedy  some  of  these  difficulties 

IV.  FIXED  POINT  ANALYSIS 

In  this  section  we  briefly  discuss  an  alternative  means 
for  analyzing  the  adaptive  system,  which  does  not  have 
to  rely  on  averaging,  We  utilize  the  fixed-point  theorem 
of  Banach  and  Cacciopoli,  referred  to  as  the  contraction 
mapping  principal.  [Actually,  the  adaptive  system  (1)  can 
be  analyzed  by  calling  upon  one  of  several  fixed  point  the¬ 
orems,  Kosut  and  Bitmead  (1986).] 

Following  Hale  (1969),  let  Ad  be  a  subset  of  a  Banach 
space  B  with  norm  ||  •  ||.  If  T  is  an  operator  mapping 
Ad  — *  B,  then  T  is  a  contraction  on  Ad  if  there  is  a  constant 
a  €  [0, 1)  such  that 

lir*  -  ry||  <  <r|]x  -  y||  / Vz,  y  €  Ad  (34) 

The  constant  a  is  referred  to  as  the  contraction  constant 
for  F  on  M.  A  fixed  point  of  T  :  Ad  *-*  M  is  a  point  ( 
function)  x  €  Ad  such  that  x  =  Tx.  We  can  now  state  the 

Contraction  Mapping  Principal  If  M  is  a  closed 
subset  of  a  Banach  space  B  and  I*  :M  — ►  Ad  is  a  contrac¬ 
tion  on  Ad,  then  T  has  a  unique  fixed  point  in  M. 

Referring  to  (20),  we  can  take  T.  as  the  mapping  of  func¬ 
tions  0(f)  into  functions  6(f)  defined  implicitly  as  follows: 

6  =  7/(*.M),  *(0)  =  6(0)  (35) 

i  =  A(0)i>-7?(1,M),  17(0)  =  0  (36) 

Observe  that  fixed  points  of  T  in  Ad,  i.e.,  those  functions 
0  €  Ad  which  satisfy  the  operator  equation 

9  =  T6  (37) 

are  solutions  in  Ad  of  the  parameter  trajectories  of  the 
adaptive  system  (20),  or  equivalently  (1).  For  example, 
the  results  in  Theorem  3  are  arrived  at  by  choosing 

Ad  =  {0  e  C[0,oo)  :  ||0(f)  -  0.||  <  r.  +  rx  exp(-Af)}  (38) 

where  C[0, 00)  is  the  Banach  space  of  continuous  bounded 
functions,  and  r„ri,  and  A  are  positive  constants. 

In  the  process  of  establishing  that  T  is  contractive  on 
Ad,  we  can  utilize  the  method  of  averaging  to  establish 
the  stability,  near  0. ,  of 

*  =  7/M.  0)  (39) 

This  is  the  origin  of  condition  (24).  It  is  important  to 
point  out  that  neither  averaging  nor  small  7  is  required 
to  establish  the  stability  of  (39)  near  0. .  For  example,  if 
the  function  /(l,  0,0)  is  periodic  in  f  uniformly  for  0  in  a 
compact  set,  then  stability  of  (39)  near  0.  can  be  estab¬ 
lished  by  linearization  and  Floquet  Theory,  Hale  (1969). 


Condition  (24)  is  then  replaced  by  a  weaker  condition  and 
the  limitation  on  the  allowable  size  of  the  adaptation  gain 
is  considerably  reduced  over  that  imposed  by  averaging 
theory.  An  early  application  of  Floquet  theory  to  the  lin¬ 
earized  adaptive  system  is  reported  in  a  paper  by  James 
(1971).  A  similar  example  is  given  in  Mareels  et  al.(1986). 

V.  CONCLUDING  REMARKS 

We  have  briefly  described  some  recent  results,  mostly 
contained  in  Anderson  et  al.(1986),  which  pertain  to  the 
stability  analysis  of  adaptive  systems.  As  indicated,  these 
results,  which  are  based  on  the  classical  method  of  aver¬ 
aging,  can  be  extended  to  other  than  slow  adaptation  by 
calling  upon  one  of  several  fixed  point  theorems.  Since 
the  use  of  such  results  is  well  beyond  hand  calculation  it 
is  imperitive  to  develop  the  means  to  numerically  estab¬ 
lish  the  contraction  conditions.  Observe  that  because  the 
contraction  analysis  considers  operators  in  Banach  spaces, 
the  same  results  apply  to  discrete-time  or  hybrid  adaptive 
systems,  i.e.,  any  linear  adaptive  control  of  a  linear  plant. 
Extentions  to  adaptive  nonlinear  systems,  although  simi¬ 
lar  in  principal  to  the  present  analysis,  often  turn  out  to 
have  their  own  dominating  idiosyncrasies,  e.g.,  Workman 
et  al.(1987). 
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Abstract  -  A  generic  parameter  adaptive  system  is 
proposed  and  analyzed  for  the  case  where  the  sys¬ 
tem  is  linear  time  invariant  whenever  the  parame¬ 
ters  are  held  fixed.  Conditions  for  local  convergence 
and  divergence  are  obtained. 


I.  INTRODUCTION 

Uncertainty  underlies  the  reason  for  adaptation.  Some 
physical  processes  defy  practical  mathematical  modeling, 
whereas  others,  with  a  well  defined  structure,  have  uncer¬ 
tain  parameters.  In  both  of  these  instances  it  is  often  the 
case  that  filtering,  prediction,  and  control  systems  can  be 
designed  with  adjustable  parameters  which  can  be  tuned  so 
as  to  accomadate  the  system  uncertainty. 

At  the  present  time  we  stand  at  the  beginning  stages  in 
the  development  of  theory  which  is  directly  applicable  to 
adaptive  systems.  To  cite  one  example,  in  the  past  few 
years  there  has  been  vigorous  activity  and  debate  on  the 
question  of  the  robustness  of  adaptive  control,  i.e.,  what 
happens  when  ideal  conditions  are  violated,  as  would  be 
expected  in  practice.  For  example,  simulations  of  simple 
systems  under  apparently  minor  non-ideal  conditions  have 
shown  degraded  performance  and  even  instabilities,  e.g., 
Rohrs  et  al.(19S5).  Although 

there  is  still  not  as  yet  a  complete  theory  for  the  stability  of 
adaptive  systems,  many  of  the  issues  can  now  be  addressed. 

In  this  paper  we  examine  the  stability  of  adaptive  systems 
from  the  point  of  view  of  parameter  convergence  or  diver¬ 
gence.  We  review  and  extend  some  of  the  results  which 
are  contained  in  the  recently  published  textbook  by  Ander¬ 
son  et  al,(19S6).  These  results  arise  from  the  application 
of  some  of  the  classical  methods  for  analyzing  differential 

*  This  research  supported  partly  by  AFOSR.  con¬ 
tracts  F49C20-S5-C-0094  and  F49620-86-C-0100,  and  NSF 
grant  ECS-SG05C46. 


equations,  e.g.,  linearization,  the  method  of  averaging,  and 
Lyapunov  second  method,  see,  e.g.,  Hale(19G9).  The  ma¬ 
terial  in  the  text  represents  modifications  and  refinements 
of  earlier  work,  specifically:  Astrom  (1983,  1984)  showing 
how  the  method  of  averaging  explains  instabilities  and  drift; 
Rcidle  and  Kokotovic  (1985,  1986)  on  slow  adaptation  and 
the  integral  manifold;  Kosut,  Anderson,  and  Mareels(19S7) 
on  the  relation  between  averaging  and  persistent  excitation; 
Bodson  et  al.(19S6)  on  nonlinear  averaging  analysis  and  de¬ 
termining  the  rate  of  convergence;  and  Kosut  and  Anderson 
(1986),  Kosut  and  Johnson(19S4)  on  linearization  and  lo¬ 
cal  stability.  Stochastic  averaging  methods  have  also  been 
employed  to  study  the  asymptotic  parameter  trajectories  of 
recursive  parameter  estimation  algorithms  in  system  iden¬ 
tification  methods,  e.g.,  Ljung  and  Soderstrom(19S3). 

Averaging:  Uses  and  Limitations 

Although  the  method  of  averaging  provides  a  great  deal  of 
insight  into  the  behavior  of  adaptive  systems,  it  is  not  a 
panacea.  In  the  first  place,  the  method  of  averaging  re¬ 
quires  slow  adaptation  which  can  be  counter-productive  in 
some  instances  because  performance  can  be  below  par  for 
the  long  period  of  time  it  takes  for  the  parameters  to  adjust. 
Secondly,  the  results  are  valid  only  when  the  parameters 
are  restricted  to  a  subset  of  the  constant  parameter  stabil¬ 
ity  set.  Projection  techniques  can  be  employed  to  restrict 
the  parameters  as  required,  but  in  many  practical  cases 
this  is  not  feasible  nor  even  necessary,  e.g.,  in  output  error 
system  identification  and  adaptive  control,  momentary  un¬ 
stable  parameter  settings  can  induce  very  rapid  learning. 
However,  the  exact  mechanism  is  not  well  understood. 

Another  area  of  concern  when  using  the  method  of  averag¬ 
ing  as  an  analysis  tool,  is  that  the  speed  of  adaptation  re¬ 
quired  to  satisfy  the  theoretical  conditions  is  most  often  far 
below  that  as  determined  from  simulations.  Thus,  although 
slow  adaptation  allows  for  an  analysis  which  provides  quan¬ 
titative  measures  of  stability-instability  boundaries,  rate  of 
convergence,  and  robustness  to  unmodeled  effects,  there  are 


a. 

inherent  restrictions  in  the  analysis. 

Transient  Analysis 

To  remove  these  restrictions  requires  understanding  the 
transient  behavior  of  adaptive  systems.  Some  preliminary 
results  are  reported  in  Kosut  et  al.  (1987)  and  Kosut  and 
Bitmead  (198C).  The  tools  for  analysis  involve  a  combi¬ 
nation  of  small  gain  theory,  passivity,  and  the  method  of 
averaging,  with  these  all  linked  together  by  the  Contraction 
Mapping  Principal.  Some  of  these  ideas  will  be  described 
and  expanded  upon  here. 

Beyond  Hand  Calculations 

Although  each  of  these  tools,  in  principal,  involves  straight¬ 
forward  calculations,  even  simple  examples  can  just  barely 
be  worked  out  by  hand.  It  is  clear  that  the  level  of  com¬ 
plexity  of  a  realistic  adaptive  system  is  well  beyond  hand 
calculation.  Hence,  in  order  for  any  of  the  above  mentioned 
analytic  methods  to  be  of  practical  benifit,  it  is  imperitve 
to  develop  “user-friendly”  software  tools  which  provide  the 
requisite  nonlinear  analysis. 

At  the  present  time,  aside  from  simulation  capbility,  there 
are  no  available  software  tools  for  dealing  principally  with 
adaptive  systems,  and  certainly  none  for  more  general  non¬ 
linear  systems.  This  is  a  research  issue  in  both  mathematics 
and  computation,  and  it  is  one  that  is  essential  to  a  con¬ 
tinuing  study  of  adaptive  systems.  This  point  will  not  be 
'  pursued  here,  but  it  is  sometimes  an  overlooked  research 
issue,  and  thus  warrents  more  than  a  passing  remark. 


II.  LINEAR  ADAPTIVE  SYSTEMS 
Adaptive  System  Structure 

The  adaptive  system  we  will  examine  is  shown  in  Figure  2-1. 
This  system  structure  is  valid  for  most  parameter  adaptive 
filtering,  prediction,  and  control  systems,  be  they  continu¬ 
ous,  discrete,  or  hybrid  [see,  e.g.,  Kosut  and  Bitmead(19SG), 

Ljung  and  Soderstrom(1983)l. 

In  this  system  u(t)  and  y(t)  are  the  vectors  of  measured 
inputs  and  outputs,  u;(t)  is  a  vector  of  exogenous  inputs, 
i.e.,  references,  disturbances,  and  noise  sources.  The  vector 
of  adaptive  parameters  is  8{t)  with  initial  value  8Q.  The 
adaptive  parameter  vector  is  mappec  into  a  “control"  pa¬ 
rameter  vector  p(t)  by  some  design  rule,  typically  an  im¬ 
plicitly  defined  memoryless  nonlinear  function  .  For  exam¬ 
ple,  in  control  system  synthesis,  one  can  use  any  number  of 
model  (parameter)  based  methods,  see  e.g.,  Safonov  et  al.( 
1981)  on  LQG  based  designs  or  Vidyasagar(1985)  on  the 
stable  factorization  approach.  In  system  identification,  on 


Figure  2-1.  Parameter  adaptive  control 

the  other  hand,  we  normally  have  a  the  simple  design  rule 
8  —  p.  A  more  extensive  display  of  how  the  various  stan¬ 
dard  adaptive  identification  and  control  systems  fit  into  the 
structure  of  Figure  2-1  is  provided  in  Anderson  et  al.(19S6). 

Parameter  Adaptive  Algorithm 

In  order  to  illustrate  the  basic  ideas,  we  will  assume  here 
that  the  parameter  adaptive  algorithm  is  contiuously  ad¬ 
justed.  In  practice,  parameter  adjustments  would  be  either 
at  discrete  times  or  as  in  the  case  of  adaptive  control,  the 
control  signal  is  the  output  of  a  digital  computer. 

We  will  also  make  the  simplifying  assumption  that  the  plant 
is  linear-time-invariant  with  a  rational  proper  transfer  func¬ 
tion,  and  for  fixed  values  of  the  control  parameter  vector  p, 
so  is  the  block  tabled  control.  Then,  the  parameter  adap¬ 
tive  system  of  Figure  2-1  may  be  described  by  the  coupled 
set  of  ordinary  differential  equations 

i  =  A(0)x  +  B(8)w(t) 

*  =  7  (<)«(*,*.«) 

where  6(t)  6  RF  is  the  adaptive  parameter,  and  x(t)  €  R" 
is  the  system  state,  consisting  of  plant,  controller,  and  filter 
states.  The  matrix  functions  A{8),  B(0)  are  determined  by 
the  design  rule  9  p  and  the  parametric  controller  struc¬ 
ture.  The  nonlinear  function  )  is  determined  by  the 

choice  of  algorithm,  and  7(f)  is  the  speed  of  adjustment, 
often  referred  to  as  the  adaptation  gain.  For  example,  with 
a  simple  gradient  algorithm  7 (t)  «=  7,  *  positive  constant, 
whereas  with  a  recursive  least  squares  (RLS)  type  of  algo¬ 
rithm  7(1)  =  1/t.  Most  algorithms  are  never  used  in  these 
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simplified  forms.  Modifications  include  gain  normalization, 
dead-zones,  leakage,  projection,  and  covariance  resetting, 
to  name  a  few.  The  reasons  for  the  modifications  arise 
from  a  priori  knowiege  of  parameter  ranges  and  the  desire 
to  add  some  robustness  to  the  algorithm.  Obviously  there 
are  a  myriad  of  possibilities. 

In  general,  adaptive  algorithms  share  a  similar  genisis  and 
purpose,  namely  to  adjust  0(f)  so  that  it  asymptotically 
approaches  a  member  of  the  set 

Toyt  =  {6  €  R*  :  avgjs2]  is  a  minimum)  (2.2) 

where  avg(-)  is  defined  by 

avg(.Y)  .  -  J*  X(t)  dt  (2.3) 

In  the  ideal  case,  the  parametrization  can  be  selected  so  as 
to  acheive  what  is  called  perfect  matching,  i.e.,  for  some  re¬ 
stricted  class  of  exogenous  inputs  (usually  no  disturbances), 
and  a  sufficient  number  of  parameters  such  that  there  are 
no  unmodeled  dynamics,  the  set  has  a  single  mem¬ 
ber  such  that  e(t)  =  0.  Under  more  realistic  conditions, 
the  best  that  can  happen  is  that  6(t)  asymptotically  ap¬ 
proaches  a  small  neighborhood  of 

/.=  {(€  R'  :  avg[tfe]  =  0}  (2.4) 


(1)  How  do  the  tuned  parameters  depend  on  the  ex¬ 
ogenous  inputs? 

(2)  Is  the  adaptive  system  stable  in  a  neighborhood 
■c.  of  the  tuned  system,  i.e.,  are  solutions  (x,0)  sta¬ 
ble  near  (x.,0.)?  Furthermore,  how  small  is  this 

_  neighborhood? 

(3)  What  is  the  region  of  attraction  in  (x,  6)  to  a  small 
neighborhood  of  (x.,0.)? 

(4)  What  is  the  rate  of  convergence  to  this  small  neigh¬ 
borhood  of  (x.,0.)? 

Some  of  these  questions  can  be  answered  by  the  method  of 
averaging. 


III.  AVERAGING  ANALYSIS 

The  classical  method  of  averaging  applies  to  a  differential 
equation  of  the  form 

±=7/(‘.*),7>0  (3.1) 

where  7  is  a  positive  constant.  If  7  is  sufficiently  small  then 
under  suitable  regularity  conditions  on  f(t,  x),  the  stabil¬ 
ity  of  the  time- varying  system  (3.1)  is  inherited  from  the 
stability  of  the  simpler  autonomous  system 


If  for  fixed  6,  the  regressor  4(t)  is  constructed  to  be  identical 
with  —Oe(t)/d0,  then  T,  is  the  set  of  all  local  minima 
of  avg[c?].  Since  0e(t)/09  is  a  function  of  the  true,  but 
unknown  plant,  the  regessor  can  at  best  be  constructed 
as  an  approximation.  From  a  practical  point  of  view  it  is 
acceptable  that  the  parameters  approach  and  remain  in  a 
■mail  neighborhood  of  F.,  provided  that  members  of  this 
aet  also  produce  acceptable  performance. 

The  Tuned  System 

Assuming  this  is  so,  let  8,  €  R’  denote  such  a  setting,  of 
which  there  could  be  many.  We  refer  to  each  0.  as  a  tuned 
parameter  and  to  the  corresponding  system 


i  -  l/.(x)  (3.2) 

where 

/«(*)  *  *vg /(•,*) 

1  rT  (3.3) 

1{t'x)dt 

This  system  is  referred  to  as  the  averaged  system.  Observe 
that  for  /a(x)  to  exist,  some  other  restrictions  must  also  ap¬ 
ply  to  /(f,  x),  e.g.,  f(t,  x)  is  almost  periodic  in  t  uniformly 
for  x  in  a  compact  set;  although  weaker  conditions  can  be 
stated. 


x.  =  A(8.)x  +  B(0.)w(t ) 
c.  =  crx. 

=  Dx. 


(2.5) 


as  the  tuned  system  (see,  e.g.,  Kosut  and  Friedlander(  1 985)] 
Clearly  (  2.5)  is  the  same  as  system  (2.1)  but  with  0(f)  fixed 
at  0..  We  can  now  pose  the  following  questions' regarding 
the  adaptive  system  (2.1): 


The  above  type  of  result  is  certainly  expected  on  intuitive 
grounds,  provided  that  /(t,x)  is  smooth  enough  and  the 
average  value  /.(x)  exists.  A  precise  formulation  of  the 
stability-instability  conditions  is  given  in  Theorem  V.3.1, 
Hale(1969). 

In  order  to  apply  the  above  result  to  the  adaptive  system 
(2.1)  it  is  first  neceesary  to  make  a  transformation  of  vari¬ 
ables,  so  that  the  resulting  system  has  the  appropriate  form 


u  expressed  by  (3.1).  The  transformation  is  re¬ 

ferred  to  as  a  time-scale  decomposition. 

Time-Scale  Decomposition 

From  the  previous  discussions  about  the  origins  of  the  adap¬ 
tive  algorithm,  we  are  dearly  interested  in  the  behavior  of 
the  adaptive  system  in  the  neighborhood  of  the  tuned  sys¬ 
tem.  But  the  tuned  constant  parameter  setting  9,  is  not 
known  before  hand.  Hence,  following  the  proceedures  given 
in  Anderson  et  al.(19SC),  we  study  the  behavior  of  (2.1)  in 
the  neighborhood  of  all  constant  parameter  solutions.  For 
this  purpose,  let  i/(t,  0)  denote  the  state  x(t)  when  7(1)  =  0. 
We  refer  to  v(t,  8)  as  the  frozen  parameter  system  state,  or 
frozen  Jtate  for  short.  Hence,  for  each  8  6  Rp,  v(t,8)  sat¬ 
isfies  the  partial  differential  equation 


Ov/dt  =  A(8)v  +  B(8)w(t) 
By  introducing  the  error  state 

»?(t)  =  *(0  -  •'M(O) 


(3.4) 


(3.5) 


the  (x,  0)-system  of  (2.1)  can  be  transformed  into  the  equiv¬ 
alent  (r],6)-  system: 


6  -y(t)f(t,8,v) 

ij  =  A(8)q  -  l(t)g(t,8,ri) 


(3.6) 


Asymptotic  Analysis 

The  definition  of  the  tuned  system  ( x.,8 .)  as  well  as  the 
stability  of  the  adaptive  system  in  the  neighborhood  of  the 
tuned  system  can  be  answered  by  Theorem  4.2  in  Anderson 
et -el. (1086)  or  Section  3  in  Bodson  et  al.(1985).  The  flavor 
of  these  results  can  be  stated  as  follows: 


Theorem  1  Let  the  tuned  parameter  set  be  defined  as  those 
8.  £  R'  which  satisfy 


*vg[/(-A»o)l  =  o 
Re  A[j4(0.)j  <  0 


(3.8) 


If  the  functions  f(t,  0,  ij)  and  g(t ,  8,  tj)  are  sufficiently  smooth., 
almost  periodic  in  t  uniformly  for  8,  tj  in  compact  sets,  then 
for  a  sufficiently  small  adaptation  gain  7,  and  a  sufficiently 
small  peak  value  of  the  tuned  error  signal  £.(t),  solutions 
of  (t.l)  originating  in  a  small  neighborhood  of  the  tuned 
system  (x.,8.)  will  remain  there  if 

mpxRehi[B(8.)}<0  (39) 

and,  moreover,  will  leave  there  if 

max  Re  A, -[£(#.)]  >  0  (3.10) 

where  the  matrix  function  8  r*  B(8)  is  given  by 


where  the  functions  /  and  g  are  given  by 
f(t,8,r))  =  q(v(t,  fl)  +  ij) 

0(t,8,r))  =  [dv(t,8)/d6)f(t,6,rl) 

The  transformation  of  (2.1)  into  (3.6)  is  referred  to  as  a 
time-scale  decomposition,  because  in  general,  0(1)  changes 
much  more  slowly  than  «?(<)•  This  is  certainly  the  case 
whenever  the  adaptation  gain,  7(t),  is  small.  Observe  that 
when  7(t)  s=  1/t,  there  is  always  some  time  t0  >  0  for 
which  7 (t)  is  as  small  as  necessary.  But,  whenever  6(t)  is 
near  convergence,  it  is  also  changing  slowly,  and  hence,  we 
would  expect  the  averaging  results  to  apply  even  though 
the  adaptation  gain  may  not  be  small.  We  will  return  to 
this  point  later. 

For  now,  let  us  assume  that  the  adaptation  gain  is  con¬ 
stant,  but  small,  i.e.,  7(f)  =  7  where  7  is  a  small  positive 
number.  Since  the  (rj,8)-system  of  (3.6)  is  equivalent  to  the 
original  (x,0)-sy»tem  of  (2.1),  the  answers  to  the  questions 
posed  before  will  involve  the  analysis  of  (3.6).  In  particular, 
the  analysis  can  divided  into  answering  questions  about  its 
asymptotic  and  transient  characteristics. 


B(fi)=^avg[/M,0)]  (3.11) 

In  addition,  under  the  conditions  stated  above,  if  (8.9)  holds 
then 

limsup  ||0(t)  -  0.||  =  0(limsup|e.(t)|)  +  0(7) 

1—00  1—00  /j  j2) 

limsup  ||r)(l)||  =  0(7) 

f— *oo 

The  type  of  smoothness  conditions  required  of  the  functions 
f(t,  8,  rj)  and  g(t, 8,  ij)  is  not  severe,  e.g.,  continuity  in  t  and 
Lipschitz  continuity  in  8,tf  for  8,  >/  in  compact  sets. 

The  sharp  stability-instability  boundary  expressed  by  (3  9) 
and  (3.10)  allows  not  only  for  an  assessment  of  a  particular 
design,  but  also  indicates  how  to  modify  and  improve  the 
algorithm. 

It  is  possible  to  obtain  specific  expressions  for  the  right 
hand  sides  of  (3.12),  but  these  are  usually  too  coarse  to 
be  of  practical  quantitative  value.  In  the  next  section  we 


will  discuss  the  underlying  method  for  obtaining  the  above 
results  and  show  that,  in  the  first  place,  averaging  is  not 
necessary,  and  secondly  the  conservatism  in  the  bounds  can 
be  reduced  significantly.  First,  however,  we  address  the 
transient  characteristics. 

Transient  Analysis 

An  understanding  of  the  transient  properties  of  the  adap¬ 
tive  system  requires  answering  the  questions  posed  before, 
namely,  determining  the  region  of  attraction  to  a  small 
neighborhood  of  the  tuned  system,  and  the  rate  of  con¬ 
vergence.  The  folowing  result,  based  on  averaging,  can  be 
found  in  Anderson  et  al.(l'JSG). 

Theorem  2  Under  the  conditions  stated  in  Theorem  1, 
if  the  initial  parameter  value  0„  is  strictly  inside  a  convex 
subset  of  the  constant  parameter  stability  set,  then  6(t )  will 
remain  in  that  set  and  converge  exponentially  at  a  rate  no 
slower  than  0( 7)  to  the  small  neighborhood  of  the  tuned 
system  defined  in  Theorem  1. 

Some  of  the  restrictions  can  be  relaxed,  see,  e.g.,  Riedle 
and  Kokotovic(19S6)  or  Dodson  et  al.(1985).  Specifically, 
the  region  of  attraction  can  be  relaxed  to  a  compact  subset 
of  the  constant  parameter  stability  set;  also,  estimates  of 
the  rate  of  convergence  far  from  the  tuned  setting  do  not 
have  to  be  0(7). 

These  type  of  results  arc  pleasing  from  an  intuitive  point 
of  view,  thus  providing  a  qualitative  analysis.  However, 
they  do  not  apply  when  the  parameters  leave  the  constant 
parameter  stability  set,  nor  do  they  provide  quantitative 
performance  measures.  In  addition,  we  still  require  slow 
adaptation,  which  in  itself  is  not  too  objectionable,  per¬ 
haps  only  cautious,  but  again  specific  bounds  on  the  adap¬ 
tion  gain  extracted  from  the  theory  tend  to  be  extremely 
conservative.  In  the  next  section  we  show  how  it  may  be 
possible  to  remedy  some  of  these  difficulties 


IV.  FIXED  POINT  ANALYSIS 

Here  we  will  dcscibe  a  general  nonlinear  analysis  tool,  namely 
the  fixed-point  theorem  of  Banach  and  Cacciopoli,  referred 
to  as  the  contraction  mapping  principal.  .Among  other  pos¬ 
sibilities,  it  enables  one  to  compute  the  rate  of  convergence 
and  region  of  attraction  for  the  adaptive  system.  [Actually, 
the  adaptive  system  as  represented  by  (3.6)  can  be  analyzed 
by  culling  upon  one  of  several  fixed  point  theorems,  Ivosut 
and  Bitmead(19SC).j 


Following  Hale(19G9),  let  At  be  a  subset  of  a  Banach  space 
B  with  norm  ||  •  ||.  If  T  is  an  operator  mapping  M  — *  B, 
then  r  is  a  contraction  on  h  '  if  there  is  a  constant  p  €  (0,1) 
such  that 

||rx  -  ry||  <  p||x  -  v||  ,Vx, y  €  A1  (4.1) 

The  constant  p  is  referred  to  as  the  contraction  constant  for 
r  on  M.  A  fixed  point  of  T  :  M  <-•  M  is  a  point  (  function) 
x  €  M  such  that  x  =  Fx.  We  can  now  state  the 

Contraction  Mapping  Principal  If  M  is  a  closed  subset 
of  a  Banach  space  13  and  T  :  At  — <  At  is  a  contraction  on 
At,  then  T  has  a  unique  fixed  point  in  At. 

Referring  to  (3.0),  we  can  take  F  as  the  mapping  of  func¬ 
tions  S(t)  into  functions  6(t)  defined  implicitly  as  follows: 


j  =  A(9)i)  -  ig{t,B,rj) 

Observe  that  fixed  points  of  T  in  At,  i.e.,  those  functions 
0  6  At  which  satisfy  the  operator  equation 

6  -  T8  (4.3) 

are  solutions  in  At  of  the  parameter  trajectories  of  the 
adaptive  system  (3.6),  or  equivalently  (2.1).  For  example, 
the  results  (3.9)-(3.12)  are  arrived  at  by  choosing 


At  =  [6  6  CjO,  00)  :  || 0(t)  -  0.||  <  r.  +  ri  exp(-ot)}  (4.4) 

where  C[0,  00)  is  the  Banach  space  of  continuous  bounded 
functions,  and  r»,ri,  and  a  are  positive  constants. 

In  the  process  of  establishing  that  T  is  contractive  on  At, 
we  utilize  the  method  of  averaging  to  establish  the  stability, 
near  6 .,  of 

0  =  7 /(*,0,O)  (4-5) 

This  is  the  origin  of  condition  (3.9).  It  is  important  to 
point  out  that  neither  averaging  nor  small  7  is  required  to 
establish  the  stability  of  (4.5)  near  For  example,  if  the 
function  /(f,0,O)  is  periodic  in  1  uniformly  for  0  in  a  com¬ 
pact  set,  then  stability  of  (4.5)  near  0.  can  be  established 
by  linearization  and  Floquet  Theory,  Hale(l9G0).  Condi¬ 
tion  (3.9)  is  then  replaced  by  a  weaker  condition  and  the 
limitation  on  the  allowable  size  of  the  adaptation  gain  is 


considerably  reduced  over  that  imposed  by  averaging  the¬ 
ory.  An  early  application  of  Floquet  theory  is  reported  in 
a  paper  by  James  (1071),  although  it  was  not  carried  over 
to  the  nonlinear  adaptive  system. 


V.  CONCLUDING  REMARKS 

We  have  briefly  described  some  recent  results,  mostly  con¬ 
tained  in  Anderson  et  al.(lOSC),  which  pertain  to  the  stabil¬ 
ity  analysis  of  adaptive  systems.  As  indicated,  these  results, 
which  are  based  on  the  classical  method  of  averaging,  can 
be  extended  to  other  than  slew  adaptation  by  calling  upon 
one  of  several  fixed  point  theorems.  Since  the  use  of  such 
results  is  well  beyond  hand  calculation  it  is  iinperitive  to 
develop  the  means  to  numerically  establish  the  contraction 
conditions.  Establishing  such  conditions  can  be  viewed  as 
a  canonical  problem  in  analyzing  the  stability  properties  of 
adaptive  systems.  Observe  that  because  the  contraction 
analysis  considers  operators  in  Banach  spaces,  the  same  re¬ 
sults  apply  to  discrete-time  or  hybrid  adaptive  systems,  i.e., 
any  linear  adaptive  control  of  a  linear  plant.  Extentions  to 

adaptive  nonlinear  systems,  although  similar  in  principal 
to  the  present  analysis,  often  turn  out  to  have  their  own 
dominating  idiosyncrasies,  e.g.,  Workman  et  al.(19S7). 
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